Gene Namu_1772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1772 
Symbol 
ID8447374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1942767 
End bp1943927 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content69% 
IMG OID645040898 
ProductROK family protein 
Protein accessionYP_003201151 
Protein GI258651995 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.415249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.404833 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGC GGACCGAGTC CTTTCCAGCC CTGTCCGAGG GCTCCCGGGC CGCGCTCGTC 
CGGCTGCTGG TCCATGGCCC GGCGTCCCGG GCCGATCTGG CCCGGCGGCT TCGTCTCTCA
CCCGCGAGCC TGACCCGGAT CGTGCGCACG CTGGAGGACA GCGGCCTCGT GGTGGAGTCC
GAGACCACCG TGCCGCAGCG GATGGGTCGC CCGTCCCAGG CGATGGAAGT CAACGTTGAC
GCCGTCCACC TCGTCGGCAT CAAGCTCCTG GCCGGCGAGA TCAATCTGGT TCGCACCGAC
ATGCGCAGCA CGGTGCTCGG CCACCGGACC ATCCCTCTGC AGACCGTCCC GCTGGATCCG
GCCATCGACC GGATCACCGA AGCCATCCTG GCCGAGGTGG CGGTCGATCC GGCCGTCGGA
GCAGTGGGGA TCAGCCTGGC CGGGCCGGTG GATCCCAGTT CGGACATGGT CACCCACTCC
CCTTTTCTCG GCTGGGAGGA CGTACCCCTG GCTCGCTTGG TCAGCGAGCG AACGGGATTG
CCCACCGTGA TCGAAAATGA CGTGCGGGCG CTGACCGCCG CCGAGCATTG GTTCGGCGCA
GCCGCGGGCG CAACCGATTT CGTCTTGGTC ACCATCGGCG CGGGCATCGG CTGCGGGGTC
GTGATCGGTG ACCGGTTGGT TGACGGCAAC ACCGGTGGCG CAGGCCAGAT CGGGCACCTG
CCGATCACCC CGTCCGGGCC GTTGTGCGAA CGTGGTCATC GCGGCTGCGC CCGGTCGTAC
TTGGCGTCCT CGGCGATGGT CGGACAGGCG TCCATGGCCC TGCACCGGCC CGATCTCACC
TATGCCGAAC TCGTGTCGTT GGCCCACCAG GGCGAGCGGG TCGCCAGCCG GGTCGTGCGG
GATGCCGGCT ACGCGTTGGG CACCCTGATC GGACTGGTGA CCGCGATCAC CGCGCCCAGC
AAGGTGATCA TCTCCGGTGA GGGGGTCACG ATGGTCCCGC TGGTCATGGA CGTCGTGCAG
GAGCGGGCCA GCGAGGTCGA ACACTGGGCC GTGCCCGATG TTCCCATCGA GATTGCCGAA
TTCGGCTTCG TGGAATGGGC CCGCGGCGCT GCCGTCATTG CCCTGCAGCA ACTACTGGAA
GCGGCCATTA GCCCCGCCTG A
 
Protein sequence
MSERTESFPA LSEGSRAALV RLLVHGPASR ADLARRLRLS PASLTRIVRT LEDSGLVVES 
ETTVPQRMGR PSQAMEVNVD AVHLVGIKLL AGEINLVRTD MRSTVLGHRT IPLQTVPLDP
AIDRITEAIL AEVAVDPAVG AVGISLAGPV DPSSDMVTHS PFLGWEDVPL ARLVSERTGL
PTVIENDVRA LTAAEHWFGA AAGATDFVLV TIGAGIGCGV VIGDRLVDGN TGGAGQIGHL
PITPSGPLCE RGHRGCARSY LASSAMVGQA SMALHRPDLT YAELVSLAHQ GERVASRVVR
DAGYALGTLI GLVTAITAPS KVIISGEGVT MVPLVMDVVQ ERASEVEHWA VPDVPIEIAE
FGFVEWARGA AVIALQQLLE AAISPA