Gene Ent638_1564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1564 
SymbolmdoG 
ID5114534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1717942 
End bp1719543 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content51% 
IMG OID640491753 
Productglucan biosynthesis protein G 
Protein accessionYP_001176294 
Protein GI146311220 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000805878 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.762961 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGGA TCGAATTAAG CACACAAAGG GGGAAGTGCT TACTTATTAT GAAACATAAA 
TCAAAAATGA TGAAAATGCG TTGGTTGGGT GCGGCAGTGT TGTTGTCCCT GTATACCTCG
TCAGCATGGT CTTTTTCCAT CGACGACGTC GCAAAACAGG CTAAATCGAT GGCTGATAAG
GGCTACGAAG CGCCGAAAAG TAACCTGCCC TCCGTTTTCC GCGACATGAA ATACGCGGAC
TATCAGCAGA TCCAGTTTAA TCACGATAAA GCGTACTGGA ACAATATAAA AACCCCGTTC
AAGCTTGAAT TTTATCACCA GGGTATGTAT TTCGACACGC CGGTTGCCAT CAATGAAGTG
ACGGCAACTG CGGTACGTAA GATCAAATAC AGCCCGGACT ATTTTAATTT TGGCGATGTG
AAGCACGATA AAGACACCGT GAAAGACCTG GGTTTTGCAG GCTTTAAAGT TCTTTATCCA
ATCAATAGCA AAGATAAAAA CGACGAAATC GTCAGCATGC TGGGCGCCAG CTATTTCCGT
GTGATTGGCT CCGGCCAGGT TTACGGTCTG TCTGCGCGCG GTCTGGCGAT TGATACGGCT
CTGCCATCTG GCGAAGAATT CCCGCGTTTC CGTGAGTTCT GGATCGAACG TCCAAAACCA
ACGGATAAAC GCCTGACCAT CTATGCACTG CTCGACTCCC CGCGTGCGAC GGGCGCTTAC
CGCTTCGTCA TTATGCCGGG TCGTGACTCC GTTATTGACG TGCAGTCGAA AGTCTACCTG
CGTGACAAAG TGGGCAAACT GGGTGTTGCA CCATTAACCA GCATGTTCCT GTTTGGGCCA
AATCAGCCTT CGCCGGCAAC GAATTTCCGC CCGGAACTGC ATGATTCAAA CGGTCTTTCT
ATTCTTGCCG GTAACGGCGA GTGGATTTGG CGTCCGCTGA ATAACCCGAA ACACCTGGCT
GTCAGCAGCT TCGCAATGGA AAACCCGCAG GGCTTTGGTT TACTCCAGCG CGGCCGTCAG
TTCTCCCGCT TTGAAGATCT CGACGATCGT TACGATCTGC GTCCAAGTGC GTGGGTTACG
CCAAAAGGTG ATTGGGGTAA AGGAAAAGTC GAGCTGGTTG AAATTCCAAC CAACGACGAA
ACCAATGACA ACATCGTCGC TTACTGGGCT CCGGATCAGC TTCCTGAGGC CGGCAAAGAG
ATGAACTTCA ACTACACCAT CACCTTTAGC CGCGATGAAG ACAAATTGCA TGCGCCGGAC
AACGCGTATG TGATGCAGAC GCGTCGCTCT ACGGGTGACG TGAAGCAGTC AAATCTAATT
CGTCAACCTG ATGGTACCGT CGCGTTTGTG GTGGACTTCA CGGGTCAGGA CATGAAAAAA
CTGGCGCCGG ACACCCCTGT GACTGCGCAA ACCAGCATTG GTGATAATGG CGAGATCGCG
GAAAGCTCCG TACGTTATAA CCCAGTAACC AAAGGGTGGC GCTTAACCCT GCGCGTGAAA
GTGAAAGATC CAAAACAGAC CACTGAAATG CGCGCTGCGC TGGTCAATGC CGATCAGCCG
CTAAGCGAAA CCTGGAGCTA TCAGTTACCT GCTAATGAAT AA
 
Protein sequence
MDRIELSTQR GKCLLIMKHK SKMMKMRWLG AAVLLSLYTS SAWSFSIDDV AKQAKSMADK 
GYEAPKSNLP SVFRDMKYAD YQQIQFNHDK AYWNNIKTPF KLEFYHQGMY FDTPVAINEV
TATAVRKIKY SPDYFNFGDV KHDKDTVKDL GFAGFKVLYP INSKDKNDEI VSMLGASYFR
VIGSGQVYGL SARGLAIDTA LPSGEEFPRF REFWIERPKP TDKRLTIYAL LDSPRATGAY
RFVIMPGRDS VIDVQSKVYL RDKVGKLGVA PLTSMFLFGP NQPSPATNFR PELHDSNGLS
ILAGNGEWIW RPLNNPKHLA VSSFAMENPQ GFGLLQRGRQ FSRFEDLDDR YDLRPSAWVT
PKGDWGKGKV ELVEIPTNDE TNDNIVAYWA PDQLPEAGKE MNFNYTITFS RDEDKLHAPD
NAYVMQTRRS TGDVKQSNLI RQPDGTVAFV VDFTGQDMKK LAPDTPVTAQ TSIGDNGEIA
ESSVRYNPVT KGWRLTLRVK VKDPKQTTEM RAALVNADQP LSETWSYQLP ANE