Gene Ent638_1698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1698 
Symbol 
ID5112437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1848358 
End bp1849845 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content60% 
IMG OID640491887 
Productsuccinylglutamic semialdehyde dehydrogenase 
Protein accessionYP_001176428 
Protein GI146311354 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR03240] succinylglutamic semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTAT GGATCAATGG TGACTGGGTC ACGGGCGAAG GCGAACGACG CGTGAAGACC 
AATCCGGTCG GCAACGAGGC GCTCTGGCAG GGTTTTGATG CCAGTCCGGC TCAGGTCGAG
CAGGCGTGCC AGGCGGCCAG AAAAGCCTTT CCGGCGTGGG CAAAACTGCC GTTTACGGCC
CGTCAGGCGA TTGTCGAAAA ATTTGCGACG CTGCTCGAAG CCAATAAAGC AGAACTGACC
CGCGTTATCG CCCGCGAAAC CGGCAAGCCG CGCTGGGAAG CGACGACCGA AATTACAGCG
ATGATCAATA AAATCACCAT TTCGGTTAAG GCCTATCATA CCCGCACCGG CGAACAGCAT
ACCGCTATGG CGGACGGTGC GGCGACGCTG CGCCACCGTC CACACGGCGT GCTGGCAGTA
TTCGGGCCGT ACAATTTCCC GGGACATCTG CCTAACGGCC ATATTGTGCC TGCGCTGCTG
GCAGGGAATA CGGTGATTTT CAAACCGAGC GAGCTGACGC CGTGGAGCGG TGAAGCGGTT
GTGAAACTCT GGGAACAGGC GGGTCTGCCG CCGGGCGTGC TTAATCTGGT CCAGGGCGGG
CGTGAAACCG GTCAGGCGCT GAGCGCGTTA AGCGATCTCG ACGGCCTGCT GTTTACCGGC
AGTGCGGGAA CGGGATATCA GCTGCATCGT CAGTTGGCAG GTCAGCCGGA GAAAATTCTG
GCGCTGGAAA TGGGCGGCAA TAATCCCCTG ATTGTTGAAG ATCCTGAGGA TATTGACGCC
GCTGTGCATC TGGCGATCCA GTCGGCGTTT GTCACCGCCG GACAGCGCTG CACCTGCGCA
CGTCGTCTGC TGGTGAAAAA CGGCGCACAG GGCGATGCGT TTTTAGCGCG TCTCATTGAG
GTGACCGCGC GTCTGGTGCC TGATGCATGG GACGCCGAGC CGCAACCGTT TATCGGCGGG
CTGATTTCCG AACAGGCTGC GAATAACGTC ATTCATGCCT GGCGCGAACA CGTGGCGCGG
GGCGCAAAAA CGCTGCTGGA GCCAAAGCTT GTGCAGCCGG GAACGTCACT GTTAACGCCG
GGCATCATTG ATATGTCGGA TGCACGGGAC ATCCCGGATG AAGAGGTCTT TGGGCCGCTG
CTGTGCGTCT GGCGTTACGA TGATTTCGAC AGCGCAATCG CGATGGCCAA CAATACCCGC
TATGGCCTGT CGAGCGGGCT GATTTCACCC GATCGCGAGA AGTTTGATCA ACTGCTGATT
GAAGCACGTG CGGGCATCGT GAACTGGAAC AAACCGCTAA CTGGAGCGGC GAGCACCGCG
CCGTTTGGTG GTGTGGGTGC ATCCGGTAAT CATCGTGCCA GCGCGTGGTA TGCCGCCGAT
TACTGTGCGT GGCCGATGGC CAGTCTGGAA ACGCCCGCTC TGACGTTGCC GGAGGCGCTC
AACCCAGGAC TCGATTTTAC CCAGGGGAAT GGTCATGAAA GCGCGTGA
 
Protein sequence
MSLWINGDWV TGEGERRVKT NPVGNEALWQ GFDASPAQVE QACQAARKAF PAWAKLPFTA 
RQAIVEKFAT LLEANKAELT RVIARETGKP RWEATTEITA MINKITISVK AYHTRTGEQH
TAMADGAATL RHRPHGVLAV FGPYNFPGHL PNGHIVPALL AGNTVIFKPS ELTPWSGEAV
VKLWEQAGLP PGVLNLVQGG RETGQALSAL SDLDGLLFTG SAGTGYQLHR QLAGQPEKIL
ALEMGGNNPL IVEDPEDIDA AVHLAIQSAF VTAGQRCTCA RRLLVKNGAQ GDAFLARLIE
VTARLVPDAW DAEPQPFIGG LISEQAANNV IHAWREHVAR GAKTLLEPKL VQPGTSLLTP
GIIDMSDARD IPDEEVFGPL LCVWRYDDFD SAIAMANNTR YGLSSGLISP DREKFDQLLI
EARAGIVNWN KPLTGAASTA PFGGVGASGN HRASAWYAAD YCAWPMASLE TPALTLPEAL
NPGLDFTQGN GHESA