Gene VC0395_0398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_0398 
Symbol 
ID5134894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp441983 
End bp443296 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content48% 
IMG OID640530721 
Producthypothetical protein 
Protein accessionYP_001215239 
Protein GI147671939 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.596861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTT TCGAAAACAT CTCGATCATC ATGGCGCTGA TTGCTGCCAG TTGTTTCTTT 
TCTATGTCAG AAATTTCTTT GGCTGCGGCG CGCAAAATTC GTTTGCGGCA GATGGCCGAT
GAAGGCGATG AACGTGCCGA GCGGGTGTTA GAGCTGCAAG CTCGGCCAGG CAACTTTTTC
ACTGTGGTGC AAATTGGCCT CAATGCGGTT GCCATTATGG GCGGTATTGT GGGGGAATCG
GCGTTTACCC CTTACATCCG AGCGCTGCTG GAAGGATGGA TTCCAGCCAA TCTCCTGTCA
CAAGCGAGTT TTGTGCTCTC CTTTATGCTC GTAACCAGTA TGTTTATTTT GATTGCGGAT
TTGATGCCTA AGCGGATTGC GATGGCGATG CCAGAGCGAA TTGCCACCAG TTTGGTGGGA
GGCATGCTGA TTTGCATTAC TTTGTTAAAA CCGTTTGTTT GGTTCTTCAA TGGATTGGCG
AACTTGCTGT TTCGCGCTCT CAGTGTACCG ACCGAGCGTA ATGATGAGAT CACCTCTGAC
GACATTTATG CCGTGATGGA TGCTGGCGCA GAAGCGGGCG TGCTGGATAA AGGCGAGCAA
CAGATGATGG AAAGCGTGTT TGAAATGCAG AGCATTCCAG TGACATCGGC CATGACGGCG
CGCGAAAGTT TGGTGTTTCT TAACCTCAGC GACAGTGAGG AAGTGATCAA GCAGAAAATT
TCTCAGCATC CGCACAACAA ATTCTTGGTC TGTGATGGGC AGTTGGATCA GATCAAAGGT
TACGTTGACT CGAAGGCCTT GTTAATTCGA GTAATTAATG GTCAAGGAAT GAATCTCAAA
GAGAGCAATG TGGTCATTGG TTGTCCGATT ATTCCCGATA CGTTAAGCCT TTCGGAAGCG
TTGGAGTACT TCAAAATTAA CCGCGTTGAT TTTGCGGTGG TGATGAACGA ATACGCGCTT
GTTGTAGGCG TTGTGACGTT CAACGACTTA CAAAGCGCAG TCATGGGCAC TTGGGTGCTT
GCCGAAGGGG AAGAGCAAAT CGTCGCGCGT GATGGCAACT CATGGCTAGT AGACGGGGTG
ACACCGATCA CCGACGTGAT GCGCTCCTTT GCGATTGAAG AGTTTCCTCA GCAACAAAAC
TACGAAACGA TCGCAGGATT TATGATGTAT ATGCTGCGTA AGATCCCGCG TCGTACGGAT
TCAGTGGTCT ATGCCGGCTA TAAATTTGAA GTGGTGGACA TCGATAATTA CAAAGTCGAT
CAGCTTCTGG TGAGTCGCGT TGAACCTCTC GAACCTATCG TCAAAGAAGA ATAG
 
Protein sequence
MSFFENISII MALIAASCFF SMSEISLAAA RKIRLRQMAD EGDERAERVL ELQARPGNFF 
TVVQIGLNAV AIMGGIVGES AFTPYIRALL EGWIPANLLS QASFVLSFML VTSMFILIAD
LMPKRIAMAM PERIATSLVG GMLICITLLK PFVWFFNGLA NLLFRALSVP TERNDEITSD
DIYAVMDAGA EAGVLDKGEQ QMMESVFEMQ SIPVTSAMTA RESLVFLNLS DSEEVIKQKI
SQHPHNKFLV CDGQLDQIKG YVDSKALLIR VINGQGMNLK ESNVVIGCPI IPDTLSLSEA
LEYFKINRVD FAVVMNEYAL VVGVVTFNDL QSAVMGTWVL AEGEEQIVAR DGNSWLVDGV
TPITDVMRSF AIEEFPQQQN YETIAGFMMY MLRKIPRRTD SVVYAGYKFE VVDIDNYKVD
QLLVSRVEPL EPIVKEE