Gene VC0395_1057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_1057 
Symbol 
ID5134189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp1030342 
End bp1032159 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content50% 
IMG OID640531379 
Productterminase 
Protein accessionYP_001215893 
Protein GI147671611 
COG category[S] Function unknown 
COG ID[COG5484] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATACT CTCCCGAAAT CCGACAAGCC GCCCGAGCCC TCTATTTGAA GGCATGGACG 
CCACGCGAAA TCGCCGACGA ATTGAATCTG AACAGTGACC GAATTATTTA CTACTGGGCG
GATAAGTTTG GTTGGCGCGA TATGTTGCGT GAACAAACGA TTGATGAAGC TATCGCGAAT
CGTATTCAAA CGCTGCTTGA GGTAGAGAAC CCAAGTAAAC CGCAGTTGGA TATGCTCGAT
CGGCTGATTA ATCATCACGT CAAACTTAAG AAGCTGCGCG CTACTGAGCA ACCGACTCAG
CCCAATGAAG CTGGTACAGT TTCGGCGCAA AGTGGTGCAC ATAGTAGCAA AAGTGGTTCA
CCTAAGGCCG AATCTGGCAC ACAAACGGGC GATTCTGGTA AACAATCTGC GCCCAGTGGT
AAACGTAAGA AAGTTAAGAA TGATGTTAGT GAGATCACCG AGGCCGATTT TAAGCTGTGG
CATGACTCGC TGTTTGCCTA TCAGCACACG ATGCGTAACA ACCTGCACCA GCGGACACGT
AACATTCTCA AGTCTCGCCA AATTGGCGCA ACCTATTACT TTGCAGGTGA AGCGTTAGAA
CAGGCGATTC TCACGGGCGA TAACCAGATA TTTCTCTCAG CCTCTCGCGC TCAAGCTGAT
GTTTTCCGTC GCTATATTGT GGCGATTGCA AAAGAGTTTT TAGGCATTGA GATCACGGGT
AACCCTTCTA CTTTGTCGAA TGGTGCAGAG TTGCACTATC TCTCTACCAA CGGCAAAACT
GCACAGAGTT ACCACGGCCA CGTTTATATT GATGAGTATT TCTGGATCGG CAAGTTTGAC
GAGCTGAACA AAGTCGCCTC GGCGATGGCT ACGCATAAAA AGTGGCGTAA GACTTACTTT
TCCACCCCTT CTTCTAAGAT GCACCCTGCT TACCCGTTCT GGACGGGTGA AAAATGGCGC
GGCGATAAAA CCACTCGGAA AAATATTGAG TTCCCGACCT TTGATGAACT GCGCGATGGC
GGTCGCTTGT GCCCTGATCG CCAGTGGCGT TATGTGGTTA CGATTGAGGA TGCCGCTAAG
GGTGGCTGTG ACCTCTTTGA TATTGAGGAA CTGCGCGAAG AGTACAGCGA GACGGACTTC
AACAACTTGT TTATGTGCGT GTTTGTTGAT GGTGCCAGCT CGATATTTGA ATTTAATAAG
ATTGAACGCT GCATGGTGGA TAGCGAGATT TGGCAGGACT TCAAGCCAAA CGCTGCTCGC
CCATTTGGTA GCCGTGAGGT GTGGTTAGGC TATGACCCAT CACGAACCCG TGATAATGCG
GTGCTGATGG TGGTCGCGCC ACCGATTGTG GCGGTTGAGA AATTCCGTGT GCTTGAGAAA
CACACTTGGC GCGGGCTTTC TTTCCAACAT CAGGCTTCTG AGATCAGCAA AGTGTTTGAG
CGCTTCAATG TGACTTACCT TGGCATTGAT ATCACCGGCA TTGGCGCGGG TGTTCATGAC
TTGCTGGTTA ATAAGCACCC TCGCGAAACG GTGGCAATTC ACTATTCGAA CGAAAATAAA
AACCGCTTGG TGATGAAGAT GATCGACATC ATTGACGGCA ACCGCCTGCA GTTTGATGCG
GGCATGAAAG AAACGGCAAT GGCGTTTATG GCGATTAAGC GTGTCGCCAC GAACAGCGGC
AACATGATGA CCTTTAAGGC CGAACGTAGC GAGCAAGCTG GCCACGCTGA CGACTTTTGG
GCTCTTTCTC ACGCGCTGAT TAATGAACCC CTCGATCACT CCACTCAACG CAAATCAACA
TGGCAGATGG CAGCATGA
 
Protein sequence
MAYSPEIRQA ARALYLKAWT PREIADELNL NSDRIIYYWA DKFGWRDMLR EQTIDEAIAN 
RIQTLLEVEN PSKPQLDMLD RLINHHVKLK KLRATEQPTQ PNEAGTVSAQ SGAHSSKSGS
PKAESGTQTG DSGKQSAPSG KRKKVKNDVS EITEADFKLW HDSLFAYQHT MRNNLHQRTR
NILKSRQIGA TYYFAGEALE QAILTGDNQI FLSASRAQAD VFRRYIVAIA KEFLGIEITG
NPSTLSNGAE LHYLSTNGKT AQSYHGHVYI DEYFWIGKFD ELNKVASAMA THKKWRKTYF
STPSSKMHPA YPFWTGEKWR GDKTTRKNIE FPTFDELRDG GRLCPDRQWR YVVTIEDAAK
GGCDLFDIEE LREEYSETDF NNLFMCVFVD GASSIFEFNK IERCMVDSEI WQDFKPNAAR
PFGSREVWLG YDPSRTRDNA VLMVVAPPIV AVEKFRVLEK HTWRGLSFQH QASEISKVFE
RFNVTYLGID ITGIGAGVHD LLVNKHPRET VAIHYSNENK NRLVMKMIDI IDGNRLQFDA
GMKETAMAFM AIKRVATNSG NMMTFKAERS EQAGHADDFW ALSHALINEP LDHSTQRKST
WQMAA