Gene VC0395_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_0043 
Symbol 
ID5134347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp48303 
End bp49526 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content46% 
IMG OID640530366 
Producthypothetical protein 
Protein accessionYP_001214884 
Protein GI147672107 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4935] Regulatory P domain of the subtilisin-like proprotein convertases and other proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGG CGTCATTGGC ACTGGCCGTA AGTGTCGGAC TTTGCTCACC CGCAATTTGG 
GCTACGACCG AAATTGATGT ATTAGGCTTA TACACCCCAG ATACTGCGAA AGGGTTTAAG
CAAGAGCATG TCGCTCAAAT GCAGCACAGT GTGAACGTAG CGAACAAAGT CCTTAAAGAC
AGTGGTCTGG ATATCAAGGT TAATTTAGCC GCAACCAAAG AAGTGCAATA CGACACACAA
CCGGGATTGA AGAAATCTCA GAGTGAAGTG CTTGATGCTG CAACTCCATT TAATCGGATC
GACCCAGCTT TTGCGGATGT TGAAGCTTAC CGTCAACAAG TTGGGGCGGA TATGGTGGCT
ATTTTTCGTT ACCTTGATGT TAACAATTCG CCCGACTACG AACGTCAACC GAATGGTTCT
TATTCGATAA GCTGTGGTTT GGCTTGGATT GTGGCCCCTT CAGCGTGGAA ATATCCACAA
AATGCAAAGA AAAGCATGTA TAGCCACAGC TATTTAAATG AGTGTGGCGC GGAAACCTTT
ATCCATGAAT TAGGGCATAA CTTTGGTTTG AACCACGCTC ATGAGCAGTA CCGAGAGTTA
CCTCATCATA ATAATGGCAC AGAAGTCGAT GCTTATGGTT ATGGGATTAA AGGGCAATTT
GCCACCATCA TGGCATACCC GCACTTATTT GGCGTGGGGC GCAGTTATAA GTTCTCTAGC
CCCAATTTAC AGTGTGAAGG TGCTCCTTGT GGTGTGAAAG ATTATGCTAA CTCAGTACGT
GCGATAGGCC TCACGGCTCC ACATATTGCG CAAGTCTATA CGGGTACAAA GCCTCCGGTT
GATGATGGAA ATCCTGGGGA TACAGAACCG ACAGACAATA CGAATAACGT TTTTACGATA
AAAGGCCCGC TTGCACTACC GGATATGAAG ATCCTGACTT TGCCGATCGT AGTAAGTGCT
CAGCAATCCA GCACGGCGCA GGTTGCGATT GATATTACCC ATGAATACCG TGGCGATCTG
AGCATTAGGC TGTTTGCTCC AGATGGAAGC TATTGGGTAT TGAAGCAAGC AAACCGTTAT
GATCGTGGCC AAAGCTATAA CGTGCAATTC ACGCTAAATG ATGTTGATCC GTCTGCTGCT
GAAGGAGAGT GGCGTCTTGA GATCCAAGAT CACTTCGGTG GCAAATTGGG CACCTTGAAC
CAGTTCCAAA TCACCTTCCC GTAA
 
Protein sequence
MKKASLALAV SVGLCSPAIW ATTEIDVLGL YTPDTAKGFK QEHVAQMQHS VNVANKVLKD 
SGLDIKVNLA ATKEVQYDTQ PGLKKSQSEV LDAATPFNRI DPAFADVEAY RQQVGADMVA
IFRYLDVNNS PDYERQPNGS YSISCGLAWI VAPSAWKYPQ NAKKSMYSHS YLNECGAETF
IHELGHNFGL NHAHEQYREL PHHNNGTEVD AYGYGIKGQF ATIMAYPHLF GVGRSYKFSS
PNLQCEGAPC GVKDYANSVR AIGLTAPHIA QVYTGTKPPV DDGNPGDTEP TDNTNNVFTI
KGPLALPDMK ILTLPIVVSA QQSSTAQVAI DITHEYRGDL SIRLFAPDGS YWVLKQANRY
DRGQSYNVQF TLNDVDPSAA EGEWRLEIQD HFGGKLGTLN QFQITFP