Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_0454 |
Symbol | |
ID | 5134470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009456 |
Strand | + |
Start bp | 504277 |
End bp | 507318 |
Gene Length | 3042 bp |
Protein Length | 1013 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640530777 |
Product | putative exonuclease SbcC |
Protein accession | YP_001215295 |
Protein GI | 147671875 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR00618] exonuclease SbcC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCCGT TAAAACTTAT CCTCCAAGCT TTCGGCCCTT TTGTTGGACG AGAAGAGATT GACTTTACGA AGTTGGGTGA TGCTCCACTG TTTTTGATCA ATGGGGCAAC GGGAGCCGGA AAAAGCTCGA TTCTGGATGC GATTTGCTAC GCCTTGTATG GCGAAACCAC GGGCAGTGAG CGTACTGGCG ACCAAATGCG CTGCGATTAT GCGGATCCTG AGTCTCTCAC CGAAGTCAGT TTTGAGTTTG AGCTGGCCGG GGCGCGTTAT CAAATCACCC GTCAGCCCGA TCAAGAGATC CCGAAAAAAC GGGGTGAAGG GATGACGAAG AAATCCCATT CCGCTACTTT GGTTGCACTG AAAGCGGATG GAAACGAGCT GATTGCCAAC AAGCCCAATC CTGTCGCGAA GGCGGTGATC GAATTGATGG GACTTGATGT TAAGCAATTT CGCCAAGTCA TGGTGTTGCC TCAAGGCAAG TTTCGTGAGC TTTTAACCGC CAATTCAAAA GAGCGTGAGC AGATTTTTGG TCAGCTCTTT CAAACCCAGC TCTACAGCCA AATTGAACGG GCGCTGTTTG AGCGCGCCGC GGGTATTCGT AAAGAGAAAG AAGAGTTTGA TCAGCAGATC AAAGGTACGT TAAGTGTCGT CGGACTGGAA AGTGAAGAGC AGTTACAAAC CGAGTTGACC GAACTGGCCC CAGTATTAAC CCATGCGCAA TCACAACTCA AAGCTGAGCA ACAGCAGTGG GATGAAACAA AAGCGCACTA TCAAGCTGCG CTTGAGTTAG AACAACAATT TATCCGCAAG CAGCAATTGG TGGTAGAAAT CGCCACTCAC CAAGAGCAGG CTTCGCACAT CGAAATGCTG CGCCAGCAAC GCCAGCAAGC CCAAAAAGCA GCGCGTTTAA CCGCCGTCCA TCAACAGTGG CACCAAGCTC AAAAAAACCT ACTGCAAGCT AAGCTTAAGG TTGAGCAGCA GCAGACTCTG TTGCAACAGG CGAAAGCTCA GCAGCAACAG GCTCAACAGG TCAGTCAGCA AGCCAGTTTA GCCTGTGAAG AAGTACCAAA ATTAAACGAG CAACGCATCA CGTGGCAGCG TGCTGAGCAA AAATTGCTGG CACAAGAAAA TGTTCAGCAA GCGGTGGCCA AGGCTGAGCG TGAACTGCAA CTGGCGACAC AAAATGCGCT TAATTTGCAG CATGCGAGCG AAAAGCTAGA GCAAGAGCTA CAAAACCAAC GACTCGAATG GGAACAGCAA CAGCGCCAAT TAACGCGCTT AGAAGTTCAA AAAGCGCGAA TGAATCAGTT GGTGCAGCAA GTTCAGGCTC GCGAGCGAGA ACAATCCCTA CTCAATGAAT TACAAACTGC TCAACAAGCT TTATTGCGCT TTGAGCAGCA ACATCGGCAC ATCCAAACTC AGGCCGAACA AGCGAAACTG ACCGCGGATA AACTGGAGTT TGCATGGCAT ACCCAGAGGG CTGCGGAGCT TGCTCTTGCA CTCACACAAA ATGAACCTTG TCCGGTATGC GGCAGTTTAG AGCATCCCAA TAAAGCGCAA TATTCGGGTG ACGTTGTCAC CAAGGTTCAG GTTGAAAAAG CCAGACAGCA GCAACAAGAT TGGGTGCAGC GTCAGCAAGA GGCATTCCAT GCTTGGCAGC AACAAGGGTT TAAAACCGAG CAGATAGCGC AAAATCTCAC GACTTTATCG AGTGAGCTAA CTTTGCAGCA AGTGGCGTTA TTGAACGAGC TCATTGAACA GCAACAAATA CTGCACAGTG ATATTGCTGC GCTACAACAG CTTAATCCTG ATTTGCTGAA ACGGCAGATT GAAGAGGGGG AGCAGCGGTT AGCGCACACC AAAATGACGC TCGAAAAACA GAATCAAAAC CAGCAACAAG CTTGGCAGAC TTTGGCTCAG TTACAGGCGG AATTGGCAAG TTTGCGCCAA GAAATCCCGC CGGAGCTGTC CAATCTTGAT ACTTTACGAA GCGCGATCGG GCGTGTGCAG AACCAAATAG AAATCTTACA AAAAGCGGAA CATACGGCTC GTGAACAGTG GGTGCAGGCG CAAAAGCAGT TTGCCAGTGT GCAGGCCGCT CATCAGGCGG CGATTGAAGC GCACCGTGAG TCTCAGCGTC AGCAGGAGGA AACCACAAGC GCATGGCAGC AAGGGTTACT CCATTCTGGA TTTAACGATG AGTCCGCCTA TCTTGCCGCT CGTTTAACCG ATGAGGCTAT CGGCAATATC GAGCGCCAAA TCGCCCAGTA TGAAGAGCGC AGTGCGATGC TCAGTGGCGA ACAGCAAGCC TTATCACGTA AATTAGCAGA GAAAAATCGC CCAGAGCTGG AACCACTTCT TGTCAAAGTA ACTCAAGCTG AAGAAAAAAT GGAACTGGCG TTGCAGGCGT TTACGCAACA TCAATCGCGG ATGGATGGAT TGCAACGTGT CGCCAAGCAA CTGGCGGATC TTTACCAGAA AAATCGTGCA TTAGAAGCCG AATATCAGGT CGTGGGTACC TTAAGTGATA TTGCGAATGG CAAAACGGGC GCTAAAGTCA GCTTACATCG CTTTGTGCTT GGTGTTTTGC TGGATGATGT CTTGTTACAA GCTTCTCAAC GACTGATGAA AATGAGCCGA GGCCGCTATT TACTCAAACG TAAAGAGGAA CGCGCTAAAG GTAATGTAGG CTCAGGGCTG GATTTGATGG TCGAAGATAG CTACAGCGGT AAATGGCGTG ATGTGGCAAC CTTGTCCGGT GGTGAATCGT TCATGGCCGC CTTATCGCTT GCGCTTGGTT TATCCGATGT GGTTCAGGCT TACAGTGGTG GTATCCGTCT TGATACTCTG TTTATTGATG AAGGTTTTGG TAGTTTGGAT CCGGAATCTT TAGATTTAGC GATCCAAACC CTAATCGATC TTCAGCAAGG TGGTCGAACG ATAGGGATCA TCTCTCATGT TACCGAGCTG AAAGAGCAGA TCGGTCTAAG ATTGGATGTG TTGGCGACAA GAATGGGTAG CACGCTGCGT TTAATCACAT AA
|
Protein sequence | MRPLKLILQA FGPFVGREEI DFTKLGDAPL FLINGATGAG KSSILDAICY ALYGETTGSE RTGDQMRCDY ADPESLTEVS FEFELAGARY QITRQPDQEI PKKRGEGMTK KSHSATLVAL KADGNELIAN KPNPVAKAVI ELMGLDVKQF RQVMVLPQGK FRELLTANSK EREQIFGQLF QTQLYSQIER ALFERAAGIR KEKEEFDQQI KGTLSVVGLE SEEQLQTELT ELAPVLTHAQ SQLKAEQQQW DETKAHYQAA LELEQQFIRK QQLVVEIATH QEQASHIEML RQQRQQAQKA ARLTAVHQQW HQAQKNLLQA KLKVEQQQTL LQQAKAQQQQ AQQVSQQASL ACEEVPKLNE QRITWQRAEQ KLLAQENVQQ AVAKAERELQ LATQNALNLQ HASEKLEQEL QNQRLEWEQQ QRQLTRLEVQ KARMNQLVQQ VQAREREQSL LNELQTAQQA LLRFEQQHRH IQTQAEQAKL TADKLEFAWH TQRAAELALA LTQNEPCPVC GSLEHPNKAQ YSGDVVTKVQ VEKARQQQQD WVQRQQEAFH AWQQQGFKTE QIAQNLTTLS SELTLQQVAL LNELIEQQQI LHSDIAALQQ LNPDLLKRQI EEGEQRLAHT KMTLEKQNQN QQQAWQTLAQ LQAELASLRQ EIPPELSNLD TLRSAIGRVQ NQIEILQKAE HTAREQWVQA QKQFASVQAA HQAAIEAHRE SQRQQEETTS AWQQGLLHSG FNDESAYLAA RLTDEAIGNI ERQIAQYEER SAMLSGEQQA LSRKLAEKNR PELEPLLVKV TQAEEKMELA LQAFTQHQSR MDGLQRVAKQ LADLYQKNRA LEAEYQVVGT LSDIANGKTG AKVSLHRFVL GVLLDDVLLQ ASQRLMKMSR GRYLLKRKEE RAKGNVGSGL DLMVEDSYSG KWRDVATLSG GESFMAALSL ALGLSDVVQA YSGGIRLDTL FIDEGFGSLD PESLDLAIQT LIDLQQGGRT IGIISHVTEL KEQIGLRLDV LATRMGSTLR LIT
|
| |