Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0818 |
Symbol | |
ID | 5135299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 828769 |
End bp | 831834 |
Gene Length | 3066 bp |
Protein Length | 1021 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640532276 |
Product | hypothetical protein |
Protein accession | YP_001216768 |
Protein GI | 147675638 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.103467 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGATCAGC GTAGTAGGGG CAGGAAGCAA ATGTTACCAA GACTCCATCA CCAATCCGAC GTTGATCCGG TGGTTTTAAC TTTTTTACAC GAGTTAAAAA CCGCAGGATT TACCGGCGAT ATTGAAACCC AATACTCCAG CCGTCTTGCG GTAGCGACGG ATAACAGCGT CTACCAACAA TTGCCTCAAG CGGTCGTACA TCCTAAATCC ACCGCTGATG TTGTTCTTAT AGGAAAGATT AGTTCAAAAC CAGAATTTGA GAGAGTGACT TTTTCACCTC GTGGTGGCGG TACGGGCACG AATGGTCAAT CGCTGACTAA AGGGGTGGTC GTTGATTTGT CGCGGCATAT GAATCGTATC CTAGAAATTA ACCCGCAAGA GGGATGGGTA CGAGTGCAGG CGGGGGTCAT TAAAGATCAA CTCAATGATG CGGTACGCCC ACACGGTTTT TTCTTCTCTC CGGATCTTTC GACCAGTAAT CGCGCGACGC TTGGCGGCAT GGTTAACACC GATGCTTCAG GGCAAGGGTC GCTACAATAT GGCAAAACGT CGGATCACGT ACTCTCGCTG CAAGCGGTAT TTGCCGATGG TTCTTTGCTG GAAACCGATC TCTCGCAAGG TTTGCCTGCT CCCAATACGT TTGCTGCGCA AGCGATGCAA GTCACGGAGC AGGTTTGCCG CACCAAACGT AAGCAAATTG TCGCGAAATT TCCGCCGCTC AACCGCTTTT TGACTGGATA CGATTTAAAA AATGCCCTAA ATGAAGCCGA AGATCGCTTT GATATCACGC GGGTGTTGTG TGGTGCAGAA GGCTCACTGG CGTTTATTAC CGAAGCTAAA CTCAATTTAA CGCCTATCCC CAAAGCACGC ACGTTAGTGA ATGTGAAATA CGACAGCTTT GATTCTGCAC TGCGTAATGC GCCTTTAATG GTTGAAGCGA AAGCTTTGTC GGTAGAAACC GTTGACTCGA AAGTGCTCAA TCTTGCCAAA GAAGACATTA TTTGGCACAG CGTAAAAGAC TTGCTGACTG ATGTGCCTGG CAAAGAGATG CAAGGCATCA ATATGGTCGA ATACGCAGGC CAAGACAGCG CGCAGATTAA TCAGCAAGTC GCACAATTAA CCGCGCGCCT TGATGAGATG ATGGCCAACC AACAAGCGGG AATTATTGGC TATCAAGTCT GTAGTGATTT AGCGAGCATC AACCGCATTT ACAATATGCG TAAAAAAGCG GTGGGCTTGC TCGGTGCAGC CAAAGGCCGC GCTAAACCGG TTGCCTTTAC CGAAGATACC TGTGTGCCGC CTGAGAACTT AGCCGATTTT ATCGTTGAAT TTCGCGCGCT GCTCGATTCC AAAAATCTGG CGTATGGCAT GTTTGGGCAT GTGGATGCGG GCGTATTGCA TGTGCGTCCT GCGCTTGACT TGTGCGATCC CAAGCAAGAA TTGTTGATGC GTGAAATCTC CGATCAAGTG GTTAAGCTCG TCGCTAAATA TGGCGGTTTG ATGTGGGGGG AGCATGGCAA AGGCTATCGT TCGGAATATG GCCCTGAATT TTTTGGTGAG GAGCTGTTCA CTGAACTGCG TCGTGTGAAA GCGGCGTTTG ACCCGCACAA TAAGATGAAT CCGGGCAAGA TTTGTACCCC ACTTGATACA CCGTTTGAGC TGGTGAAAGT CTCCGACACT AAACGTGGTT TTTACGATCG CCAAATCGAC GTCAAAGTGC GCGACAGCTT TAAGCAAGCG ATGGAGTGTA ACGGTAACGG TTTGTGCTTT AACTACGAAA CTAGCTCCCC GATGTGTCCT TCAATGAAGG TGACGGCGGA TCGTCGCCAC TCGCCGAAAG GGCGCGCAGG ATTAGTGCGA GAATGGTTAC GTCAACTGAC GGAGCAGGGT ATTGATATCC TCGATCTTGA AAAAGCCACG CTCGAATCCT CGCCAACAAT TAAATCTATG CTAGATCGGG TTCGTCATGC TTTTAGCAAA GATAAAGAGT ACGACTTTTC GCATGAAGTG TACGAAGCGA TGAATGGTTG CTTGGCCTGT AAGGCCTGTG CAAGCCAGTG TCCGATTAAG GTGGATGTGC CCAGTTTCCG TTCGCGTTTT CTGAATATCT ATCACAGCCG TTATCCGCGC CCAGTCAAAG ATTATTTAGT CGCGAATATC GAAACCTTAT TACCTGTGAT GGCGAAGGCT CCGCAACTGG TGAATAGTGT GTTAGCACAA TCTTCGGTGC AAAAATTAAC CGCTAAAACG GTGGGTTATG TTGATGCGCC TCTCTTGTCG GTACCAACAC TCGCACAGCG TTTACGCCGT CATCCCGTCG TCCTGTTTGA TATGCAGCGT TTGGCGGGGT TATCGCAAGA AGAGCGCGAG CAACATGTGT TGATCGTGCA AGATCCGTTT ACCAGTTATT ACGATGCGGA TGTGGTCGAA GATTTTGTCG CCCTGCTGCT TAAATTAGGC AAAAAACCAG TATTGTTGCC GTTTAAACCC AATGGCAAAG CGCAACATAT TAAGGGCTTT CTACGCCAAT TTCGTTCGAC GGCAGCGAAT TCCGCGGCAT TTCTGACTCA AGTGGCGGAT CTCAATATTC CGCTCGTGGG TGTAGATCCT GCTTTGGTGC TTTGCTATCG CGATGAGTAT GTTGAAATAT TAGGCAAAGA GCGCGGTGAG TTTTCGGTTC TGACGGTTCA TGAATGGCTC AAACCACGCC TGTCGCAGTT TACGCCTCAA GTAACCGATG CACAGCCTTG GTACTTATTG GCGCACTGTA CGGAGAAAAC TAAGCTGCCG AATGCGGAAA AAGAGTGGGT GGAAATCTTC CGCCACTTTG GTACGCAGCT CAATGCGGTG GCGGTAGGGT GCTGTGGTAT GGCGGGTACC TTTGGTCATG AGGTGGATAA ATTAACCATG TCTCGCGATA TTTACGATTT GAGTTGGCAA CCCGCATTAG CGTCGCTACC GAAAGAGCGC TGTTTGGTCA CCGGTTACTC GTGTCGTAGC CAAGTGAAAC GGTTTGAGCA AATCAAACCT AAGCATCCGT TGCAGGCTCT GTTACACTTG CTATAA
|
Protein sequence | MDQRSRGRKQ MLPRLHHQSD VDPVVLTFLH ELKTAGFTGD IETQYSSRLA VATDNSVYQQ LPQAVVHPKS TADVVLIGKI SSKPEFERVT FSPRGGGTGT NGQSLTKGVV VDLSRHMNRI LEINPQEGWV RVQAGVIKDQ LNDAVRPHGF FFSPDLSTSN RATLGGMVNT DASGQGSLQY GKTSDHVLSL QAVFADGSLL ETDLSQGLPA PNTFAAQAMQ VTEQVCRTKR KQIVAKFPPL NRFLTGYDLK NALNEAEDRF DITRVLCGAE GSLAFITEAK LNLTPIPKAR TLVNVKYDSF DSALRNAPLM VEAKALSVET VDSKVLNLAK EDIIWHSVKD LLTDVPGKEM QGINMVEYAG QDSAQINQQV AQLTARLDEM MANQQAGIIG YQVCSDLASI NRIYNMRKKA VGLLGAAKGR AKPVAFTEDT CVPPENLADF IVEFRALLDS KNLAYGMFGH VDAGVLHVRP ALDLCDPKQE LLMREISDQV VKLVAKYGGL MWGEHGKGYR SEYGPEFFGE ELFTELRRVK AAFDPHNKMN PGKICTPLDT PFELVKVSDT KRGFYDRQID VKVRDSFKQA MECNGNGLCF NYETSSPMCP SMKVTADRRH SPKGRAGLVR EWLRQLTEQG IDILDLEKAT LESSPTIKSM LDRVRHAFSK DKEYDFSHEV YEAMNGCLAC KACASQCPIK VDVPSFRSRF LNIYHSRYPR PVKDYLVANI ETLLPVMAKA PQLVNSVLAQ SSVQKLTAKT VGYVDAPLLS VPTLAQRLRR HPVVLFDMQR LAGLSQEERE QHVLIVQDPF TSYYDADVVE DFVALLLKLG KKPVLLPFKP NGKAQHIKGF LRQFRSTAAN SAAFLTQVAD LNIPLVGVDP ALVLCYRDEY VEILGKERGE FSVLTVHEWL KPRLSQFTPQ VTDAQPWYLL AHCTEKTKLP NAEKEWVEIF RHFGTQLNAV AVGCCGMAGT FGHEVDKLTM SRDIYDLSWQ PALASLPKER CLVTGYSCRS QVKRFEQIKP KHPLQALLHL L
|
| |