Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1120 |
Symbol | |
ID | 5136944 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 1178659 |
End bp | 1181514 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640532578 |
Product | putative formate dehydrogenase, alpha subunit |
Protein accession | YP_001217066 |
Protein GI | 147674130 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACTCA TCAAACGTTC AGACAGCGTG ACCAAAGAGC AAAATCAGCT CGGTATTAGC CGTCGTGCCT TTATGAAAAA CACGTCACTG GCAGCGGGCG GAGCGGCGGT TGGTGCCTCT TTGTTTACGC CGGGCATGAT CCGTAAAGCT CAGGCCAGCG ACGTGGATCG CAGCGCAAAA ACGGAAGTGA AGCGTACCAT CTGTTCGCAC TGTTCCGTCG GTTGCGGTAT CTACGCTGAA GTACAAAACG GTGTTTGGAC TGGCCAAGAG CCGGCTTTTG ATCACCCATT CAACGCGGGC GGTCACTGTG CAAAAGGCGC AGCGTTACGT GAACACGGCC ACGGTGAACG TCGCCTGAAA TACCCAATGA AGCTCGAAGG CGGCAAGTGG AAGAAGATCT CTTGGGAACA AGCCATCAAT GAAATTGGTG ACAAAGCGCT GAAGATCCGT GAAGAGTCAG GCCCAGATTC GGTTTATTTT CTCGGCAGTG CTAAGCACAG TAACGAGCAG GCCTATCTGT TCCGTAAAAT GGCTTCCCTG TGGGGCACCA ACAACGTTGA CCACCAAGCG CGTATTTGCC ACTCCACCAC GGTTGCGGGT GTAGCAAACA CTTGGGGTTA TGGTGCCATG ACCAACTCAT TCAATGACAT GCACAACTGT AAGTCGATGC TGTTCATTGG ATCTAACCCC GCCGAAGCTC ACCCAGTCGC GATGCAGCAC ATTTTGATCG CAAAAGAGAA AAACAGCTGC AAAATCGTGG TTGCCGATCC TCGTCGTACC CGTACTGCAG CAAAAGCGGA TTACTTTGTT TCCCTGCGCC CGGGTAGTGA CGTAGCCTTT ATTTGGGGCG TGCTGTGGCA CGTGTTCAAA AATAACTGGG AAGACAAAGA GTACATCCGT CAACGTGTCT TCGGTATGGA TGAAATCCGC GCGGAAGTGG CCAAATGGAC ACCAGCCGAA GTTGAGCGTG TCACTGGCGT AAGCGAAGAA GAAGTTTACA CCACAGCGAA AATCCTAGCG GAAAACCGTC CGGGTTGTGT GATTTGGTGT ATGGGCGGTA CGCAACACAC CACAGGTAAC AACAATACTC GTGCGTACTG CATCCTTGAG TTGGCGCTGG GCAACATCGG TAAATCAGGC GGCGGTGCCA ACATTTTCCG TGGTCACGAT AACGTGCAAG GCGCAACCGA CTTAGGTGTG CTTTCCGATA CATTGCCGGG TTACTACGGT TTGACCGAAG GTTCATGGAA ACACTGGGCA AGCGTATGGG GCGTGGATTT CGAGTGGATC AAAAACCGCT TTGACCAAGG CACTTATAAC GGCGCATTGC CAATGGAAAC TCCGGGGATC CCTGTCTCTC GTTGGATCGA TGGTGTACTT GAAAACAAAG ACAACCTTCA GCAACGTGAA AACATCCGCG CCATGTTCTA TTGGGGTCAT GCGGTGAACT CGCAAACCCG CGGCGTGGAA ATGAAAAAGG CGATGCAAAA GCTGGATATG ATGGTGATTG TTGACCCATA CCCAACGGTT GCGGCGGTAA TGAACGATCG CACCGATGGA GTGTATCTGC TTCCAGCGAC CACTCAGTTT GAAACATACG GCAGTGTGAC GGCGTCTAAC CGTTCTATTC AGTGGCGTGA TCAGGTGATT GAGCCGCTGT TTGAATCCAA ACCTGATCAC GAAATCATGT ATCTGCTCAG TCAAAAACTG GGGATTGCAG ATCAGTTGTG TAAACACATT CGGGTTGAGA ACAATCAGCC ACTGATTGAA GACATCACCC GTGAATTTAA CCGCGGTATG TGGACGATTG GTTACACCGG ACAAAGCCCA GAGCGCTTGA AAACACACCA ACAGAACTGG CACACCTTCC ACAAAACTAC GTTGGCGGCC GAAGGTGGCC CTGCGCATGG CGATACTTAC GGTATGCCTT GGCCATGTTG GGGAACGCCA GAGATGAAAC ACCCCGGCAC ACACATTCTT TACGATACCT CGAAAACCGT AGCCGAAGGT GGCGGTAACT TCCGTACCCG TTTTGGTGTG GAGTTTGAAG GTAAGAGTTT GCTGGCTGAA GATAGCTACT CGAAAGGGTG TGAGCTGCAA GACGGCTATC CAGAATTTAG CGATAAGCTG CTGAAACAAC TCGGATGGTG GGATGATTTA ACTGCGGAAG AGAAAGCGGC TGCAGAAGGT AAAAACTGGA AAACTGACCT TTCTGGCGGC ATTCAGCGTG TCGCGATCAA ACACGGCTGT ATTCCATTTG GTAACGCGAA AGCGCGGGCG ATTGTGTGGA CATTCCCAGA CCGCGTGCCG CTGCACCGTG AACCGCTGTA TACACCACGT CGTGATCTAC TCGCTGATTA CCCGACGTGG GACGATCAAG CGTTTATTTT CCGTGTGCCA ACCCTGTACA AATCGATTCA AGCGCAAGAT AAATCAGTGG AATACCCGAT CATTCTCACT TCAGGTCGCT TGGTCGAGTA TGAAGGCGGT GGTGAAGAAA CCCGCTCTAA CCCTTGGCTA GCCGAACTAC AACAAGAGAT GTTTGTTGAA GTGAACCCGA AAGATGCCAA CGATTTAGGC TTTATGGATG GTGATATGGT TTGGGTTGAA GGCGCAGAGA AAGGGCGCAT CAAAGTCAAA GCCATGGTGA CTCGTCGGGT GAAACCGGGC ATGGCGTTCT TACCATTCCA CTTTGGTGGC AAGTTCCAAG GGGAAGATCT GCGTCCAAAA TACCCAGAAG GGACACAGCC TTATGTGGTT GGGGAAGCGG CAAACACCGC CACAACCTAC GGCTACGATC CTGTCACCTT GATGCAAGAA ACCAAAGTCA CCCTCTGTAA CATTCGTAAA GCGTAA
|
Protein sequence | MRLIKRSDSV TKEQNQLGIS RRAFMKNTSL AAGGAAVGAS LFTPGMIRKA QASDVDRSAK TEVKRTICSH CSVGCGIYAE VQNGVWTGQE PAFDHPFNAG GHCAKGAALR EHGHGERRLK YPMKLEGGKW KKISWEQAIN EIGDKALKIR EESGPDSVYF LGSAKHSNEQ AYLFRKMASL WGTNNVDHQA RICHSTTVAG VANTWGYGAM TNSFNDMHNC KSMLFIGSNP AEAHPVAMQH ILIAKEKNSC KIVVADPRRT RTAAKADYFV SLRPGSDVAF IWGVLWHVFK NNWEDKEYIR QRVFGMDEIR AEVAKWTPAE VERVTGVSEE EVYTTAKILA ENRPGCVIWC MGGTQHTTGN NNTRAYCILE LALGNIGKSG GGANIFRGHD NVQGATDLGV LSDTLPGYYG LTEGSWKHWA SVWGVDFEWI KNRFDQGTYN GALPMETPGI PVSRWIDGVL ENKDNLQQRE NIRAMFYWGH AVNSQTRGVE MKKAMQKLDM MVIVDPYPTV AAVMNDRTDG VYLLPATTQF ETYGSVTASN RSIQWRDQVI EPLFESKPDH EIMYLLSQKL GIADQLCKHI RVENNQPLIE DITREFNRGM WTIGYTGQSP ERLKTHQQNW HTFHKTTLAA EGGPAHGDTY GMPWPCWGTP EMKHPGTHIL YDTSKTVAEG GGNFRTRFGV EFEGKSLLAE DSYSKGCELQ DGYPEFSDKL LKQLGWWDDL TAEEKAAAEG KNWKTDLSGG IQRVAIKHGC IPFGNAKARA IVWTFPDRVP LHREPLYTPR RDLLADYPTW DDQAFIFRVP TLYKSIQAQD KSVEYPIILT SGRLVEYEGG GEETRSNPWL AELQQEMFVE VNPKDANDLG FMDGDMVWVE GAEKGRIKVK AMVTRRVKPG MAFLPFHFGG KFQGEDLRPK YPEGTQPYVV GEAANTATTY GYDPVTLMQE TKVTLCNIRK A
|
| |