Gene EcHS_A3406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3406 
SymbolgltD 
ID5593362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3410125 
End bp3411543 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content56% 
IMG OID640922527 
Productglutamate synthase subunit beta 
Protein accessionYP_001460015 
Protein GI157162697 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases 
TIGRFAM ID[TIGR01318] glutamate synthase small subunit family protein, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones72 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAGA ATGTTTATCA ATTTATCGAC CTGCAGCGCG TTGATCCGCC AAAGAAACCG 
CTGAAGATCC GCAAAATTGA GTTTGTTGAA ATTTACGAGC CGTTTTCCGA AGGCCAGGCC
AAAGCGCAGG CTGACCGCTG CCTGTCGTGC GGCAACCCAT ACTGCGAGTG GAAATGCCCG
GTACACAACT ACATCCCGAA CTGGCTGAAG CTCGCCAACG AGGGGCGTAT TTTTGAAGCG
GCGGAACTGT CGCACCAGAC CAACACCCTG CCGGAAGTTT GCGGACGAGT CTGCCCGCAA
GACCGTCTGT GCGAAGGTTC CTGCACTCTG AACGATGAGT TTGGCGCGGT GACCATCGGC
AACATTGAGC GCTATATCAA CGATAAAGCG TTCGAGATGG GCTGGCGTCC GGATATGTCT
GGTGTGAAAC AGACCGGTAA AAAAGTGGCG ATTATCGGCG CAGGCCCGGC AGGTCTGGCG
TGTGCGGATG TCCTGACGCG TAACGGCGTA AAAGCCGTTG TCTTCGACCG TCATCCAGAA
ATTGGCGGGC TGCTGACCTT CGGTATTCCG GCCTTCAAGC TGGAAAAAGA GGTAATGACG
CGTCGCCGTG AAATCTTCAC CGGCATGGGT ATTGAATTCA AACTCAATAC CGAAGTGGGC
CGCGACGTGC AGCTGGACGA TCTGCTGAGT GATTACGATG CCGTCTTCCT TGGCGTCGGG
ACTTATCAGT CAATGCACGG CGGGCTGGAA AACGAAGACG CCGATGGCGT GTACGCAGCG
CTGCCGTTCC TTATCGCCAA CACCAAACAG TTAATGGGCT TTGGTGAAAC CCGCGACGAA
CCGTTCGTCA GCATGGAAGG CAAACGCGTG GTGGTCCTTG GCGGTGGCGA CACTGCGATG
GACTGCGTGC GTACGTCCGT GCGCCAGGGA GCGAAGCACG TTACCTGTGC CTATCGTCGT
GATGAAGAGA ACATGCCGGG TTCCCGCCGC GAAGTGAAAA ACGCGCGGGA AGAAGGCGTA
GAGTTCAAAT TCAACGTCCA GCCACTGGGG ATTGAAGTGA ACGGTAACGG CAAAGTCAGC
GGCGTAAAAA TGGTACGTAC CGAAATGGGC GAACCGGATG CCAAAGGCCG TCGCCGCGCG
GAGATCGTTG CAGGTTCCGA ACATATCGTT CCGGCAGATG CGGTGATCAT GGCGTTTGGT
TTCCGTCCAC ACAACATGGA ATGGCTGGCA AAACACAGCG TCGAGCTGGA TTCACAAGGC
CGCATCATCG CCCCGGAAGG CAGCGACAAC GCCTTCCAGA CCAGCAACCC GAAAATCTTT
GCTGGCGGCG ATATCGTCCG TGGTTCCGAT CTGGTGGTAA CCGCTATTGC CGAAGGTCGT
AAGGCGGCAG ACGGTATTAT GAACTGGCTG GAAGTTTAA
 
Protein sequence
MSQNVYQFID LQRVDPPKKP LKIRKIEFVE IYEPFSEGQA KAQADRCLSC GNPYCEWKCP 
VHNYIPNWLK LANEGRIFEA AELSHQTNTL PEVCGRVCPQ DRLCEGSCTL NDEFGAVTIG
NIERYINDKA FEMGWRPDMS GVKQTGKKVA IIGAGPAGLA CADVLTRNGV KAVVFDRHPE
IGGLLTFGIP AFKLEKEVMT RRREIFTGMG IEFKLNTEVG RDVQLDDLLS DYDAVFLGVG
TYQSMHGGLE NEDADGVYAA LPFLIANTKQ LMGFGETRDE PFVSMEGKRV VVLGGGDTAM
DCVRTSVRQG AKHVTCAYRR DEENMPGSRR EVKNAREEGV EFKFNVQPLG IEVNGNGKVS
GVKMVRTEMG EPDAKGRRRA EIVAGSEHIV PADAVIMAFG FRPHNMEWLA KHSVELDSQG
RIIAPEGSDN AFQTSNPKIF AGGDIVRGSD LVVTAIAEGR KAADGIMNWL EV