Gene EcolC_0487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0487 
SymbolgltD 
ID6068533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp527744 
End bp529162 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content56% 
IMG OID641599892 
Productglutamate synthase subunit beta 
Protein accessionYP_001723491 
Protein GI170018537 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases 
TIGRFAM ID[TIGR01318] glutamate synthase small subunit family protein, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGA ATGTTTATCA ATTTATCGAC CTGCAGCGCG TTGATCCGCC AAAGAAACCG 
CTGAAGATCC GCAAAATTGA GTTTGTTGAA ATTTACGAGC CGTTTTCCGA AGGCCAGGCC
AAAGCGCAGG CTGACCGCTG CCTGTCGTGC GGCAACCCAT ACTGCGAGTG GAAATGCCCG
GTACACAACT ACATCCCGAA CTGGCTGAAG CTAGCCAACG AGGGGCGTAT TTTTGAAGCG
GCGGAACTGT CGCACCAGAC CAACACCCTG CCGGAAGTTT GCGGACGAGT CTGCCCGCAA
GACCGTCTGT GCGAAGGTTC CTGCACTCTG AACGATGAGT TTGGCGCGGT GACCATCGGC
AACATTGAGC GCTATATCAA CGATAAAGCG TTCGAGATGG GCTGGCGTCC GGATATGTCT
GGTGTGAAAC AGACCGGTAA AAAAGTGGCG ATTATCGGCG CAGGCCCGGC AGGTCTGGCG
TGTGCGGATG TCCTGACGCG TAACGGCGTA AAAGCCGTTG TCTTCGACCG TCATCCAGAA
ATTGGCGGGC TGCTGACCTT CGGTATTCCG GCCTTCAAGC TGGAAAAAGA GGTAATGACG
CGTCGCCGTG AAATCTTCAC CGGCATGGGT ATTGAATTCA AACTCAATAC CGAAGTGGGC
CGCGACGTGC AGCTGGACGA TCTGCTGAGT GATTACGATG CCGTCTTCCT TGGCGTCGGG
ACTTATCAGT CAATGCACGG CGGGCTGGAA AACGAAGACG CCGATGGCGT GTACGCAGCG
CTGCCGTTCC TTATCGCCAA CACCAAACAG TTAATGGGCT TTGGTGAAAC CCGCGACGAA
CCGTTCGTCA GCATGGAAGG CAAACGCGTG GTGGTCCTTG GCGGTGGCGA CACTGCGATG
GACTGCGTGC GTACGTCCGT GCGCCAGGGA GCGAAGCACG TTACCTGTGC CTATCGTCGT
GATGAAGAGA ACATGCCGGG TTCCCGCCGC GAAGTGAAAA ACGCGCGGGA AGAAGGCGTA
GAGTTCAAAT TCAACGTCCA GCCACTGGGG ATTGAAGTGA ACGGTAACGG CAAAGTCAGC
GGCGTAAAAA TGGTACGTAC CGAAATGGGC GAACCGGATG CCAAAGGCCG TCGCCGCGCG
GAGATCGTTG CAGGTTCCGA ACATATCGTT CCGGCAGATG CGGTGATCAT GGCGTTTGGT
TTCCGTCCAC ACAACATGGA ATGGCTGGCA AAACACAGCG TCGAGCTGGA TTCACAAGGC
CGCATCATCG CCCCGGAAGG CAGCGACAAC GCCTTCCAGA CCAGCAACCC GAAAATCTTT
GCTGGCGGCG ATATCGTCCG TGGTTCCGAT CTGGTGGTAA CCGCTATTGC CGAAGGTCGT
AAGGCGGCAG ACGGTATTAT GAACTGGCTG GAAGTTTAA
 
Protein sequence
MSQNVYQFID LQRVDPPKKP LKIRKIEFVE IYEPFSEGQA KAQADRCLSC GNPYCEWKCP 
VHNYIPNWLK LANEGRIFEA AELSHQTNTL PEVCGRVCPQ DRLCEGSCTL NDEFGAVTIG
NIERYINDKA FEMGWRPDMS GVKQTGKKVA IIGAGPAGLA CADVLTRNGV KAVVFDRHPE
IGGLLTFGIP AFKLEKEVMT RRREIFTGMG IEFKLNTEVG RDVQLDDLLS DYDAVFLGVG
TYQSMHGGLE NEDADGVYAA LPFLIANTKQ LMGFGETRDE PFVSMEGKRV VVLGGGDTAM
DCVRTSVRQG AKHVTCAYRR DEENMPGSRR EVKNAREEGV EFKFNVQPLG IEVNGNGKVS
GVKMVRTEMG EPDAKGRRRA EIVAGSEHIV PADAVIMAFG FRPHNMEWLA KHSVELDSQG
RIIAPEGSDN AFQTSNPKIF AGGDIVRGSD LVVTAIAEGR KAADGIMNWL EV