Gene Ent638_1750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1750 
Symbol 
ID5113271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1902805 
End bp1905855 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content57% 
IMG OID640491939 
Productformate dehydrogenase alpha subunit 
Protein accessionYP_001176480 
Protein GI146311406 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00182445 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGTTTA GCCGGAGGCA GTTCTTTAAG ATCTGCGCGG GCGGTATGGC AGGAACAACT 
GTTGCATCTC TCGGATTTTT ATCATCTTTT TCCGCGATCG CGGAAACACG TCAATACAAA
CTCTTAAAAG CAAAAGAGAC GCGTAATAAC TGTACATACT GCTCAGTGGG CTGCGGCATG
ATCATGTATA GCCTCGGCGA CGATGCCAAA AACGTCAAAG AAAGTATCTA TCACGTCGAA
GGCGACTCGG ATCACCCGGT CAGCCGTGGG TCGCTTTGCC CGAAAGGGGC AGGGGTACTG
GATTACATTC ACAGTGATAC TCGCCTGCTG TATCCCGAAT ACCGCGCGCC GGGGTCTGAT
AAATGGCAGC GTATTTCATG GGATGACGCC ATTGAGCGTA TCGCCCGCCT GATGAAAGCG
GATCGTGATG CCAACTTTAT CGAAAAGAAC GCCCAGGGGC TGACGGTCAA CCGCTGGACG
ACCACGGGCA TGCTCTGTTC GTCGGCGGCC AGCAATGAAA CCGGTATTCT CGACGGAAAG
TTTGCCCGCG CGCTGGGCAT GGTGGCGATC GACTGTCAGG CGCGTTTGTG CCACGGTCCA
ACCGTTGCTG CGCTGGCACC GACCTTCGGG CGCGGGGCGA TGACCAATAA CTGGGTTGAT
ATCAAAAACG CCAACGTGGT GTTGATCATG GGCGGCAACG CGGCGGAAGC CCATCCGGTC
GGCTTTAAAT GGGTTGTTGA AGCGCAGACC AAAAACGACG CCACGGTGGT GGTGGTCGAT
CCGCGCTTTA ACCGCAGCGC GGCGGTGGCC GATCTGTATG CGCCGATTCG CGCCGGGTCT
GACACTGCGT TTCTGCTGGG GGCCATTCGC TATTTGATCG CGCATGACGC CATTCAGCAC
GAATACGTCC GCGCCTATAC TAACGCCAGC CTGATTATCC GCGACGACTA TGCGTTCGAT
GACGGTCTGT TCAGCGGCTA TGACGCCGAA AAGCGTCAGT ACGATAAATC GAGCTGGTTC
TATCAACTGG ACGAGCAGGG CCACGTGCAG CGTGACGATA CGCTCAGCCA CCCGCGCTGC
GTCTGGAATC TGCTTAAAGC GCACGTCGAT CGCTATACGC CGGAGATGGT GAACCGCTTG
TGCGGCACGT CGATTGATGA TTTCAACCGC ATTTGCGCGA TCCTTGCCAG CACCAGCGTA
CCGGACCGCA CCGCGACAAT CTTGTACGCA CTGGGCTGGA CGCACCATTC GGCGGGCGCA
CAGATCATCC GTGCGGCGGG AATGTTGCAG TTGCTGCTGG GCAATATCGG TATGGCAGGC
GGCGGCGTCA ACGCCCTTCG CGGTCACTCC AATATTCAGG GCTACACCGA TCTGGGGTTG
CTCTCAACCA ATCTGCCGGG CTACATGCCT CTGCCGTCCG AAAAACAGCC GGATTATCAG
ACCTATATCT CGCAGATCAC GCCGCCGTCG CTGGGGCTGA ACGAAGTGAA CTACTGGCAA
AACACGCCGA AGTTCTTTAT CAGCATGATG AAAAGCTTCT GGGGCGAGCA TGCCACGGCG
GACAACAACT GGGGCTACGA CTGGCTGCCG AAATGGGATC GTTTGTATGA CGTGATGACC
CAGGCCAAGC TGATGCTCGA CGGCAAAATC AATGGTTACA TCGTTCAGGG CTTTAACCCG
CTGGCGGCGT TCCCGGATAA AAACAAATCG TCCCGCGCGC TCTCGAAGCT CAAATACATG
GTGGTTATCG ATCCGCTGGT CACCGAGTCG TCGACGTTCT GGCAGCATCA CGGTGAGATG
AACGACGTGA ATCCGGCAGA TATTCAGACC GAAGTCTTCC GCCTGCCATC GTCCTGTTTT
GCGGAAGAAG ACGGCTCGAT TGCTAACTCT GGCCGCTGGC TGCAATGGCA CTGGGCGGCC
GCTGAGCCAC CGGGTGAAGC GATGCACGAT GGCAAAATCC TTGGCCGTCT GTTTACGCGC
CTGCGCGAAC TCTATCAGGC CGAAGGCGGG GCAAACCCGG CGCCGGTGCT GAACATGTCC
TGGGATTACA AAAATCCCCG CGATCCGCAT CCGGAAGAGA TTGCCCGCGA AGCCAACGGC
ATGGCGCTGG TGGATTTGTA TGATGACAAA GGCCAACTGG TGGCGAAAAA AGGCCAGCAG
CTCAGCAGTT TTGCGCAGTT ACGCGATGAC GGCACCACCA GCAGCTTCTG CTGGGTGTAC
TGCGGAAGCT GGACCGAGCA GGGCAATCAG ATGGCGAACC GCGATAACAG CGATCCTTAT
GGGCTGGGCT GTACGCCAGG CTGGGCGTGG TCGTGGCCGG CGAACCGTCG CATTCTGTAC
AACCGCGCCT CTGCCGATGT GGCCGGGAAA CCCTGGGACG CCAAACGTGC CTTGCTGCAC
TGGGATGGCA AAAAATGGAC CGGTCAGGAC GTGGCGGATT ACAACGCCTC GGCACCGGGT
AGCAACGTCG GGCCGTTTAT CATGAATCCA GAAGGGGTGG CGCGCCTGTT CTCCATCGAC
AAGATGAACG ACGGCCCGTT CCCGGAACAT TACGAACCGA TTGAATCACC GATTGGCACC
AACCCGCTGC ATCCGAATGT CATCTCCAGC CCCGTTGCGC GGATCTTCAA AGAAGACCTG
CCGAATATGG GCAAAGCGGA TGACTTCCCG TATGTCGCCA CGACCTATTC GATCACCGAG
CTGTTCCGTC ACTGGACTAA GCATGCGCGG CTCAACGCGA TTGCACAGCC GGATCAGTTT
GTCGAAATTG GCGAAGCGCT GGCGCAAGAG AAGGGCATTG TTGCCGGGGA TGAAGTGAAA
GTGATGTCGA AACGAGGCTT TATCAAAGCA AAAGCGGTGG TCACTAAACG CCTGCAAACC
CTGACCATTG ACGGTCGCAA GGTCAACACC GTGGGCATTC CGTGTCACTG GGGCTTTGAG
GGGGCAACGC GTAAAGGGTT CCTGGCCAAT ACGTTGACGC CATCCGTGGG CGACGCCAAC
TCGCAGACGC CGGAGTACAA GGCGTTTTTA GTCGACATCG AGAAGGCGTA A
 
Protein sequence
MEFSRRQFFK ICAGGMAGTT VASLGFLSSF SAIAETRQYK LLKAKETRNN CTYCSVGCGM 
IMYSLGDDAK NVKESIYHVE GDSDHPVSRG SLCPKGAGVL DYIHSDTRLL YPEYRAPGSD
KWQRISWDDA IERIARLMKA DRDANFIEKN AQGLTVNRWT TTGMLCSSAA SNETGILDGK
FARALGMVAI DCQARLCHGP TVAALAPTFG RGAMTNNWVD IKNANVVLIM GGNAAEAHPV
GFKWVVEAQT KNDATVVVVD PRFNRSAAVA DLYAPIRAGS DTAFLLGAIR YLIAHDAIQH
EYVRAYTNAS LIIRDDYAFD DGLFSGYDAE KRQYDKSSWF YQLDEQGHVQ RDDTLSHPRC
VWNLLKAHVD RYTPEMVNRL CGTSIDDFNR ICAILASTSV PDRTATILYA LGWTHHSAGA
QIIRAAGMLQ LLLGNIGMAG GGVNALRGHS NIQGYTDLGL LSTNLPGYMP LPSEKQPDYQ
TYISQITPPS LGLNEVNYWQ NTPKFFISMM KSFWGEHATA DNNWGYDWLP KWDRLYDVMT
QAKLMLDGKI NGYIVQGFNP LAAFPDKNKS SRALSKLKYM VVIDPLVTES STFWQHHGEM
NDVNPADIQT EVFRLPSSCF AEEDGSIANS GRWLQWHWAA AEPPGEAMHD GKILGRLFTR
LRELYQAEGG ANPAPVLNMS WDYKNPRDPH PEEIAREANG MALVDLYDDK GQLVAKKGQQ
LSSFAQLRDD GTTSSFCWVY CGSWTEQGNQ MANRDNSDPY GLGCTPGWAW SWPANRRILY
NRASADVAGK PWDAKRALLH WDGKKWTGQD VADYNASAPG SNVGPFIMNP EGVARLFSID
KMNDGPFPEH YEPIESPIGT NPLHPNVISS PVARIFKEDL PNMGKADDFP YVATTYSITE
LFRHWTKHAR LNAIAQPDQF VEIGEALAQE KGIVAGDEVK VMSKRGFIKA KAVVTKRLQT
LTIDGRKVNT VGIPCHWGFE GATRKGFLAN TLTPSVGDAN SQTPEYKAFL VDIEKA