Gene EcSMS35_3509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3509 
SymbolgltD 
ID6146884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3585810 
End bp3587228 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content56% 
IMG OID641618338 
Productglutamate synthase subunit beta 
Protein accessionYP_001745485 
Protein GI170682339 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases 
TIGRFAM ID[TIGR01318] glutamate synthase small subunit family protein, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGA ATGTTTATCA ATTTATCGAC CTGCAGCGCG TTGATCCGCC AAAGAAACCG 
CTGAAGATCC GCAAAATTGA GTTTGTTGAA ATTTACGAGC CGTTTTCCGA AGGCCAGGCC
AAAGCGCAGG CTGACCGCTG CCTGTCGTGC GGCAACCCAT ACTGCGAGTG GAAATGCCCG
GTACACAACT ACATCCCGAA CTGGCTGAAG CTCGCCAACG AGGGGCGTAT TTTTGAAGCG
GCGGAACTGT CGCATCAGAC CAATACCCTG CCGGAAGTGT GCGGACGTGT CTGCCCGCAA
GACCGTCTGT GCGAAGGTTC CTGCACCCTG AACGATGAGT TCGGCGCAGT GACCATCGGC
AACATTGAGC GCTATATCAA CGATAAAGCG TTCGAGATGG GCTGGCGTCC GGATATGTCC
GGCGTGAAAC AGACCGGTAA AAAAGTGGCG ATTATCGGCG CAGGCCCGGC GGGTCTGGCG
TGTGCAGATG TCCTGACGCG CAACGGCGTA AAAGCAGTCG TCTTCGACCG TCACCCGGAA
ATCGGCGGCT TGCTGACCTT CGGTATTCCG GCCTTCAAGC TGGAAAAAGA GGTAATGACG
CGCCGCCGTG AAATCTTCAC GGGCATGGGT ATTGAATTCA AACTCAATAC CGAAGTGGGC
CGCGACGTAC AGCTGGACGA TCTGCTGAGT GATTACGATG CCGTGTTCCT TGGCGTCGGG
ACTTATCAGT CAATGCGCGG CGGGCTGGAA AACGAAGACG CCGATGGCGT GTACGCAGCG
CTGCCGTTCC TTATCGCCAA CACCAAACAG TTAATGGGCT TTGGCGAAAC CAACGACGAA
CCGTTCGTCA GCATGGAAGG CAAACGCGTG GTGGTCCTTG GCGGTGGCGA CACTGCGATG
GACTGCGTGC GCACGTCTGT ACGCCAGGGG GCAAAGCACG TTACCTGTGC CTATCGTCGT
GATGAAGAGA ACATGCCGGG TTCCCGCCGC GAAGTGAAAA ACGCGCGGGA AGAAGGTGTT
GAGTTCAAAT TCAACGTCCA GCCGCTGGGT ATAGAAGTGA ACGGTAACGG CAAAGTCAGC
GGCGTAAAAA TGGTGCGCAC AGAAATGGGC GAACCGGATG CCAAAGGCCG TCGCCGCGCA
GAGATCGTGG CAGGGTCAGA ACATATCGTT CCGGCAGATG CGGTGATCAT GGCGTTTGGT
TTCCGTCCAC ACAGCATGGA ATGGCTGGCA AAACACAGCG TCGAGCTGGA TTCGCAGGGG
CGCATTATCG CCCCGGAAGG CAACGACAAC GCTTTCCAGA CCAGCAACCC GAAAATCTTT
GCTGGCGGCG ATATCGTCCG TGGTTCCGAT CTGGTCGTTA CCGCCATTGC CGAAGGTCGC
AAAGCGGCAG ACGGCATTAT GAACTGGCTG GAAGTTTAA
 
Protein sequence
MSQNVYQFID LQRVDPPKKP LKIRKIEFVE IYEPFSEGQA KAQADRCLSC GNPYCEWKCP 
VHNYIPNWLK LANEGRIFEA AELSHQTNTL PEVCGRVCPQ DRLCEGSCTL NDEFGAVTIG
NIERYINDKA FEMGWRPDMS GVKQTGKKVA IIGAGPAGLA CADVLTRNGV KAVVFDRHPE
IGGLLTFGIP AFKLEKEVMT RRREIFTGMG IEFKLNTEVG RDVQLDDLLS DYDAVFLGVG
TYQSMRGGLE NEDADGVYAA LPFLIANTKQ LMGFGETNDE PFVSMEGKRV VVLGGGDTAM
DCVRTSVRQG AKHVTCAYRR DEENMPGSRR EVKNAREEGV EFKFNVQPLG IEVNGNGKVS
GVKMVRTEMG EPDAKGRRRA EIVAGSEHIV PADAVIMAFG FRPHSMEWLA KHSVELDSQG
RIIAPEGNDN AFQTSNPKIF AGGDIVRGSD LVVTAIAEGR KAADGIMNWL EV