Gene VC0395_A2649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2649 
Symbol 
ID5136451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2802136 
End bp2803194 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content53% 
IMG OID640534097 
Producthypothetical protein 
Protein accessionYP_001218527 
Protein GI147674782 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCTGC TTGCGATATA TATCTCTGTC GCGATTGGCG TATCGTTTAT CTGTTCAGTT 
TTAGAAGCGG TACTCTTAAG TATTACCCCA AGCTATCTAG CCCAGTTGCG CCAACAAGGC
AACCCTGCGG CTAATCGCCT AGCAGGCCTA AAGGCGGACA TCGACCGCCC ACTCGCCTCG
ATTCTGACCC TCAATACCAT TGCGCACACC ATAGGTGCTG CGACGGCTGG CGCACAAGCG
GCGGTGGTGT TTGGTAGCCA GTGGCTTGGC CTGTTCTCTG CTGTGCTCAC CCTAGGCATT
CTGGTGCTGT CGGAAATCGT GCCCAAAACC ATAGGTGCGA CCTACTGGCG TGAACTCGCT
CCCCAAGCTT CTCTCGTGCT GCGTTGGATG GTATGGGCGC TGACGCCGTT CGTCTGGTTC
TCAGAGCAGA TCACTAAGCG CCTCGCGCGC AAAGTTGAAG CGCCAAAGCT ACGTGACGAG
ATCTCCGCGA TGGCGATGTT GGCCAATGAA AACGGTGAGT TTGCAGAAGG CGAATCAAAA
ATGCTGAACA ACTTACTGGC GATTCAAAAT GTGCCAGTAA CGCAAGTTAT GACGCCGCGC
CCGGTACTGT TTCGCGTTTC CGCGGATCTA ACGATTGATG AATTTATCGA GCAGCACCGC
GATACGCCGT TCTCGCGCCC GCTGATTTAC AGCGAAGAGA AAGACAACAT TGTCGGCTTT
GTGCACCGCC TTGAGTTGTT TAAAGAGCAG CAAAATGGCC AAGGAAACTT GCTACTGGGT
GATGTGATGC GCCCAATCCA TGTGGTGCTC AACACCTTGA GCTTACCAAA AGCCTTCGAC
CAGATGATGC AAAAGCGCTT GCAACTGTCA GTCGTGGTAG ACGAATACGG CTCAGTGCAG
GGTTTGCTTA CCTTAGAAGA CATCTTCGAG CACTTGCTCG GCGAAGAGAT TATCGATGAA
GCCGACCGCA CAACCGATAT GCAGCAACTA GCCACCGAAC GCTGGGAGCA CTGGAAGCGC
CAGCATCGCA TGATCGAAAG CCGCGACGAA GTGGAATAA
 
Protein sequence
MFLLAIYISV AIGVSFICSV LEAVLLSITP SYLAQLRQQG NPAANRLAGL KADIDRPLAS 
ILTLNTIAHT IGAATAGAQA AVVFGSQWLG LFSAVLTLGI LVLSEIVPKT IGATYWRELA
PQASLVLRWM VWALTPFVWF SEQITKRLAR KVEAPKLRDE ISAMAMLANE NGEFAEGESK
MLNNLLAIQN VPVTQVMTPR PVLFRVSADL TIDEFIEQHR DTPFSRPLIY SEEKDNIVGF
VHRLELFKEQ QNGQGNLLLG DVMRPIHVVL NTLSLPKAFD QMMQKRLQLS VVVDEYGSVQ
GLLTLEDIFE HLLGEEIIDE ADRTTDMQQL ATERWEHWKR QHRMIESRDE VE