Gene VC0395_0454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_0454 
Symbol 
ID5134470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp504277 
End bp507318 
Gene Length3042 bp 
Protein Length1013 aa 
Translation table11 
GC content49% 
IMG OID640530777 
Productputative exonuclease SbcC 
Protein accessionYP_001215295 
Protein GI147671875 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00618] exonuclease SbcC 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCCGT TAAAACTTAT CCTCCAAGCT TTCGGCCCTT TTGTTGGACG AGAAGAGATT 
GACTTTACGA AGTTGGGTGA TGCTCCACTG TTTTTGATCA ATGGGGCAAC GGGAGCCGGA
AAAAGCTCGA TTCTGGATGC GATTTGCTAC GCCTTGTATG GCGAAACCAC GGGCAGTGAG
CGTACTGGCG ACCAAATGCG CTGCGATTAT GCGGATCCTG AGTCTCTCAC CGAAGTCAGT
TTTGAGTTTG AGCTGGCCGG GGCGCGTTAT CAAATCACCC GTCAGCCCGA TCAAGAGATC
CCGAAAAAAC GGGGTGAAGG GATGACGAAG AAATCCCATT CCGCTACTTT GGTTGCACTG
AAAGCGGATG GAAACGAGCT GATTGCCAAC AAGCCCAATC CTGTCGCGAA GGCGGTGATC
GAATTGATGG GACTTGATGT TAAGCAATTT CGCCAAGTCA TGGTGTTGCC TCAAGGCAAG
TTTCGTGAGC TTTTAACCGC CAATTCAAAA GAGCGTGAGC AGATTTTTGG TCAGCTCTTT
CAAACCCAGC TCTACAGCCA AATTGAACGG GCGCTGTTTG AGCGCGCCGC GGGTATTCGT
AAAGAGAAAG AAGAGTTTGA TCAGCAGATC AAAGGTACGT TAAGTGTCGT CGGACTGGAA
AGTGAAGAGC AGTTACAAAC CGAGTTGACC GAACTGGCCC CAGTATTAAC CCATGCGCAA
TCACAACTCA AAGCTGAGCA ACAGCAGTGG GATGAAACAA AAGCGCACTA TCAAGCTGCG
CTTGAGTTAG AACAACAATT TATCCGCAAG CAGCAATTGG TGGTAGAAAT CGCCACTCAC
CAAGAGCAGG CTTCGCACAT CGAAATGCTG CGCCAGCAAC GCCAGCAAGC CCAAAAAGCA
GCGCGTTTAA CCGCCGTCCA TCAACAGTGG CACCAAGCTC AAAAAAACCT ACTGCAAGCT
AAGCTTAAGG TTGAGCAGCA GCAGACTCTG TTGCAACAGG CGAAAGCTCA GCAGCAACAG
GCTCAACAGG TCAGTCAGCA AGCCAGTTTA GCCTGTGAAG AAGTACCAAA ATTAAACGAG
CAACGCATCA CGTGGCAGCG TGCTGAGCAA AAATTGCTGG CACAAGAAAA TGTTCAGCAA
GCGGTGGCCA AGGCTGAGCG TGAACTGCAA CTGGCGACAC AAAATGCGCT TAATTTGCAG
CATGCGAGCG AAAAGCTAGA GCAAGAGCTA CAAAACCAAC GACTCGAATG GGAACAGCAA
CAGCGCCAAT TAACGCGCTT AGAAGTTCAA AAAGCGCGAA TGAATCAGTT GGTGCAGCAA
GTTCAGGCTC GCGAGCGAGA ACAATCCCTA CTCAATGAAT TACAAACTGC TCAACAAGCT
TTATTGCGCT TTGAGCAGCA ACATCGGCAC ATCCAAACTC AGGCCGAACA AGCGAAACTG
ACCGCGGATA AACTGGAGTT TGCATGGCAT ACCCAGAGGG CTGCGGAGCT TGCTCTTGCA
CTCACACAAA ATGAACCTTG TCCGGTATGC GGCAGTTTAG AGCATCCCAA TAAAGCGCAA
TATTCGGGTG ACGTTGTCAC CAAGGTTCAG GTTGAAAAAG CCAGACAGCA GCAACAAGAT
TGGGTGCAGC GTCAGCAAGA GGCATTCCAT GCTTGGCAGC AACAAGGGTT TAAAACCGAG
CAGATAGCGC AAAATCTCAC GACTTTATCG AGTGAGCTAA CTTTGCAGCA AGTGGCGTTA
TTGAACGAGC TCATTGAACA GCAACAAATA CTGCACAGTG ATATTGCTGC GCTACAACAG
CTTAATCCTG ATTTGCTGAA ACGGCAGATT GAAGAGGGGG AGCAGCGGTT AGCGCACACC
AAAATGACGC TCGAAAAACA GAATCAAAAC CAGCAACAAG CTTGGCAGAC TTTGGCTCAG
TTACAGGCGG AATTGGCAAG TTTGCGCCAA GAAATCCCGC CGGAGCTGTC CAATCTTGAT
ACTTTACGAA GCGCGATCGG GCGTGTGCAG AACCAAATAG AAATCTTACA AAAAGCGGAA
CATACGGCTC GTGAACAGTG GGTGCAGGCG CAAAAGCAGT TTGCCAGTGT GCAGGCCGCT
CATCAGGCGG CGATTGAAGC GCACCGTGAG TCTCAGCGTC AGCAGGAGGA AACCACAAGC
GCATGGCAGC AAGGGTTACT CCATTCTGGA TTTAACGATG AGTCCGCCTA TCTTGCCGCT
CGTTTAACCG ATGAGGCTAT CGGCAATATC GAGCGCCAAA TCGCCCAGTA TGAAGAGCGC
AGTGCGATGC TCAGTGGCGA ACAGCAAGCC TTATCACGTA AATTAGCAGA GAAAAATCGC
CCAGAGCTGG AACCACTTCT TGTCAAAGTA ACTCAAGCTG AAGAAAAAAT GGAACTGGCG
TTGCAGGCGT TTACGCAACA TCAATCGCGG ATGGATGGAT TGCAACGTGT CGCCAAGCAA
CTGGCGGATC TTTACCAGAA AAATCGTGCA TTAGAAGCCG AATATCAGGT CGTGGGTACC
TTAAGTGATA TTGCGAATGG CAAAACGGGC GCTAAAGTCA GCTTACATCG CTTTGTGCTT
GGTGTTTTGC TGGATGATGT CTTGTTACAA GCTTCTCAAC GACTGATGAA AATGAGCCGA
GGCCGCTATT TACTCAAACG TAAAGAGGAA CGCGCTAAAG GTAATGTAGG CTCAGGGCTG
GATTTGATGG TCGAAGATAG CTACAGCGGT AAATGGCGTG ATGTGGCAAC CTTGTCCGGT
GGTGAATCGT TCATGGCCGC CTTATCGCTT GCGCTTGGTT TATCCGATGT GGTTCAGGCT
TACAGTGGTG GTATCCGTCT TGATACTCTG TTTATTGATG AAGGTTTTGG TAGTTTGGAT
CCGGAATCTT TAGATTTAGC GATCCAAACC CTAATCGATC TTCAGCAAGG TGGTCGAACG
ATAGGGATCA TCTCTCATGT TACCGAGCTG AAAGAGCAGA TCGGTCTAAG ATTGGATGTG
TTGGCGACAA GAATGGGTAG CACGCTGCGT TTAATCACAT AA
 
Protein sequence
MRPLKLILQA FGPFVGREEI DFTKLGDAPL FLINGATGAG KSSILDAICY ALYGETTGSE 
RTGDQMRCDY ADPESLTEVS FEFELAGARY QITRQPDQEI PKKRGEGMTK KSHSATLVAL
KADGNELIAN KPNPVAKAVI ELMGLDVKQF RQVMVLPQGK FRELLTANSK EREQIFGQLF
QTQLYSQIER ALFERAAGIR KEKEEFDQQI KGTLSVVGLE SEEQLQTELT ELAPVLTHAQ
SQLKAEQQQW DETKAHYQAA LELEQQFIRK QQLVVEIATH QEQASHIEML RQQRQQAQKA
ARLTAVHQQW HQAQKNLLQA KLKVEQQQTL LQQAKAQQQQ AQQVSQQASL ACEEVPKLNE
QRITWQRAEQ KLLAQENVQQ AVAKAERELQ LATQNALNLQ HASEKLEQEL QNQRLEWEQQ
QRQLTRLEVQ KARMNQLVQQ VQAREREQSL LNELQTAQQA LLRFEQQHRH IQTQAEQAKL
TADKLEFAWH TQRAAELALA LTQNEPCPVC GSLEHPNKAQ YSGDVVTKVQ VEKARQQQQD
WVQRQQEAFH AWQQQGFKTE QIAQNLTTLS SELTLQQVAL LNELIEQQQI LHSDIAALQQ
LNPDLLKRQI EEGEQRLAHT KMTLEKQNQN QQQAWQTLAQ LQAELASLRQ EIPPELSNLD
TLRSAIGRVQ NQIEILQKAE HTAREQWVQA QKQFASVQAA HQAAIEAHRE SQRQQEETTS
AWQQGLLHSG FNDESAYLAA RLTDEAIGNI ERQIAQYEER SAMLSGEQQA LSRKLAEKNR
PELEPLLVKV TQAEEKMELA LQAFTQHQSR MDGLQRVAKQ LADLYQKNRA LEAEYQVVGT
LSDIANGKTG AKVSLHRFVL GVLLDDVLLQ ASQRLMKMSR GRYLLKRKEE RAKGNVGSGL
DLMVEDSYSG KWRDVATLSG GESFMAALSL ALGLSDVVQA YSGGIRLDTL FIDEGFGSLD
PESLDLAIQT LIDLQQGGRT IGIISHVTEL KEQIGLRLDV LATRMGSTLR LIT