Gene VC0395_A1120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1120 
Symbol 
ID5136944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1178659 
End bp1181514 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content51% 
IMG OID640532578 
Productputative formate dehydrogenase, alpha subunit 
Protein accessionYP_001217066 
Protein GI147674130 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing
[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACTCA TCAAACGTTC AGACAGCGTG ACCAAAGAGC AAAATCAGCT CGGTATTAGC 
CGTCGTGCCT TTATGAAAAA CACGTCACTG GCAGCGGGCG GAGCGGCGGT TGGTGCCTCT
TTGTTTACGC CGGGCATGAT CCGTAAAGCT CAGGCCAGCG ACGTGGATCG CAGCGCAAAA
ACGGAAGTGA AGCGTACCAT CTGTTCGCAC TGTTCCGTCG GTTGCGGTAT CTACGCTGAA
GTACAAAACG GTGTTTGGAC TGGCCAAGAG CCGGCTTTTG ATCACCCATT CAACGCGGGC
GGTCACTGTG CAAAAGGCGC AGCGTTACGT GAACACGGCC ACGGTGAACG TCGCCTGAAA
TACCCAATGA AGCTCGAAGG CGGCAAGTGG AAGAAGATCT CTTGGGAACA AGCCATCAAT
GAAATTGGTG ACAAAGCGCT GAAGATCCGT GAAGAGTCAG GCCCAGATTC GGTTTATTTT
CTCGGCAGTG CTAAGCACAG TAACGAGCAG GCCTATCTGT TCCGTAAAAT GGCTTCCCTG
TGGGGCACCA ACAACGTTGA CCACCAAGCG CGTATTTGCC ACTCCACCAC GGTTGCGGGT
GTAGCAAACA CTTGGGGTTA TGGTGCCATG ACCAACTCAT TCAATGACAT GCACAACTGT
AAGTCGATGC TGTTCATTGG ATCTAACCCC GCCGAAGCTC ACCCAGTCGC GATGCAGCAC
ATTTTGATCG CAAAAGAGAA AAACAGCTGC AAAATCGTGG TTGCCGATCC TCGTCGTACC
CGTACTGCAG CAAAAGCGGA TTACTTTGTT TCCCTGCGCC CGGGTAGTGA CGTAGCCTTT
ATTTGGGGCG TGCTGTGGCA CGTGTTCAAA AATAACTGGG AAGACAAAGA GTACATCCGT
CAACGTGTCT TCGGTATGGA TGAAATCCGC GCGGAAGTGG CCAAATGGAC ACCAGCCGAA
GTTGAGCGTG TCACTGGCGT AAGCGAAGAA GAAGTTTACA CCACAGCGAA AATCCTAGCG
GAAAACCGTC CGGGTTGTGT GATTTGGTGT ATGGGCGGTA CGCAACACAC CACAGGTAAC
AACAATACTC GTGCGTACTG CATCCTTGAG TTGGCGCTGG GCAACATCGG TAAATCAGGC
GGCGGTGCCA ACATTTTCCG TGGTCACGAT AACGTGCAAG GCGCAACCGA CTTAGGTGTG
CTTTCCGATA CATTGCCGGG TTACTACGGT TTGACCGAAG GTTCATGGAA ACACTGGGCA
AGCGTATGGG GCGTGGATTT CGAGTGGATC AAAAACCGCT TTGACCAAGG CACTTATAAC
GGCGCATTGC CAATGGAAAC TCCGGGGATC CCTGTCTCTC GTTGGATCGA TGGTGTACTT
GAAAACAAAG ACAACCTTCA GCAACGTGAA AACATCCGCG CCATGTTCTA TTGGGGTCAT
GCGGTGAACT CGCAAACCCG CGGCGTGGAA ATGAAAAAGG CGATGCAAAA GCTGGATATG
ATGGTGATTG TTGACCCATA CCCAACGGTT GCGGCGGTAA TGAACGATCG CACCGATGGA
GTGTATCTGC TTCCAGCGAC CACTCAGTTT GAAACATACG GCAGTGTGAC GGCGTCTAAC
CGTTCTATTC AGTGGCGTGA TCAGGTGATT GAGCCGCTGT TTGAATCCAA ACCTGATCAC
GAAATCATGT ATCTGCTCAG TCAAAAACTG GGGATTGCAG ATCAGTTGTG TAAACACATT
CGGGTTGAGA ACAATCAGCC ACTGATTGAA GACATCACCC GTGAATTTAA CCGCGGTATG
TGGACGATTG GTTACACCGG ACAAAGCCCA GAGCGCTTGA AAACACACCA ACAGAACTGG
CACACCTTCC ACAAAACTAC GTTGGCGGCC GAAGGTGGCC CTGCGCATGG CGATACTTAC
GGTATGCCTT GGCCATGTTG GGGAACGCCA GAGATGAAAC ACCCCGGCAC ACACATTCTT
TACGATACCT CGAAAACCGT AGCCGAAGGT GGCGGTAACT TCCGTACCCG TTTTGGTGTG
GAGTTTGAAG GTAAGAGTTT GCTGGCTGAA GATAGCTACT CGAAAGGGTG TGAGCTGCAA
GACGGCTATC CAGAATTTAG CGATAAGCTG CTGAAACAAC TCGGATGGTG GGATGATTTA
ACTGCGGAAG AGAAAGCGGC TGCAGAAGGT AAAAACTGGA AAACTGACCT TTCTGGCGGC
ATTCAGCGTG TCGCGATCAA ACACGGCTGT ATTCCATTTG GTAACGCGAA AGCGCGGGCG
ATTGTGTGGA CATTCCCAGA CCGCGTGCCG CTGCACCGTG AACCGCTGTA TACACCACGT
CGTGATCTAC TCGCTGATTA CCCGACGTGG GACGATCAAG CGTTTATTTT CCGTGTGCCA
ACCCTGTACA AATCGATTCA AGCGCAAGAT AAATCAGTGG AATACCCGAT CATTCTCACT
TCAGGTCGCT TGGTCGAGTA TGAAGGCGGT GGTGAAGAAA CCCGCTCTAA CCCTTGGCTA
GCCGAACTAC AACAAGAGAT GTTTGTTGAA GTGAACCCGA AAGATGCCAA CGATTTAGGC
TTTATGGATG GTGATATGGT TTGGGTTGAA GGCGCAGAGA AAGGGCGCAT CAAAGTCAAA
GCCATGGTGA CTCGTCGGGT GAAACCGGGC ATGGCGTTCT TACCATTCCA CTTTGGTGGC
AAGTTCCAAG GGGAAGATCT GCGTCCAAAA TACCCAGAAG GGACACAGCC TTATGTGGTT
GGGGAAGCGG CAAACACCGC CACAACCTAC GGCTACGATC CTGTCACCTT GATGCAAGAA
ACCAAAGTCA CCCTCTGTAA CATTCGTAAA GCGTAA
 
Protein sequence
MRLIKRSDSV TKEQNQLGIS RRAFMKNTSL AAGGAAVGAS LFTPGMIRKA QASDVDRSAK 
TEVKRTICSH CSVGCGIYAE VQNGVWTGQE PAFDHPFNAG GHCAKGAALR EHGHGERRLK
YPMKLEGGKW KKISWEQAIN EIGDKALKIR EESGPDSVYF LGSAKHSNEQ AYLFRKMASL
WGTNNVDHQA RICHSTTVAG VANTWGYGAM TNSFNDMHNC KSMLFIGSNP AEAHPVAMQH
ILIAKEKNSC KIVVADPRRT RTAAKADYFV SLRPGSDVAF IWGVLWHVFK NNWEDKEYIR
QRVFGMDEIR AEVAKWTPAE VERVTGVSEE EVYTTAKILA ENRPGCVIWC MGGTQHTTGN
NNTRAYCILE LALGNIGKSG GGANIFRGHD NVQGATDLGV LSDTLPGYYG LTEGSWKHWA
SVWGVDFEWI KNRFDQGTYN GALPMETPGI PVSRWIDGVL ENKDNLQQRE NIRAMFYWGH
AVNSQTRGVE MKKAMQKLDM MVIVDPYPTV AAVMNDRTDG VYLLPATTQF ETYGSVTASN
RSIQWRDQVI EPLFESKPDH EIMYLLSQKL GIADQLCKHI RVENNQPLIE DITREFNRGM
WTIGYTGQSP ERLKTHQQNW HTFHKTTLAA EGGPAHGDTY GMPWPCWGTP EMKHPGTHIL
YDTSKTVAEG GGNFRTRFGV EFEGKSLLAE DSYSKGCELQ DGYPEFSDKL LKQLGWWDDL
TAEEKAAAEG KNWKTDLSGG IQRVAIKHGC IPFGNAKARA IVWTFPDRVP LHREPLYTPR
RDLLADYPTW DDQAFIFRVP TLYKSIQAQD KSVEYPIILT SGRLVEYEGG GEETRSNPWL
AELQQEMFVE VNPKDANDLG FMDGDMVWVE GAEKGRIKVK AMVTRRVKPG MAFLPFHFGG
KFQGEDLRPK YPEGTQPYVV GEAANTATTY GYDPVTLMQE TKVTLCNIRK A