Gene VC0395_A0334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0334 
Symbol 
ID5137320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp354012 
End bp355781 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content49% 
IMG OID640531792 
Producthypothetical protein 
Protein accessionYP_001216290 
Protein GI147674081 
COG category[S] Function unknown 
COG ID[COG4715] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAACCATC GGGATTCGAT AGGCAAAGTG CAATTCATCT TGAGCTATCT CTACTCTAGA 
AAGCCAAACA ACTTTGAAAT GGAACTGCCC ATGCCTAGCC ACTCCTCGTT TTTAGATATT
TCAGCGCTGT TTGAACAGTG TGAACAGAGT TCACTACTGA AAGGTGCACA ACTCGCGAAA
AGTGGCTCGG TACGCAAGCT CACCATCACT GGGCATACCG TCAACGCAGA AGTGAAAGGC
TCTCAACTTT ATCGCGTACA ACTTACTGGC GGTCAAGCTT CCTCTAGCCG CTGCACTTGC
CCTGCGGCGG GCCATCAAGC CGTATGCAAG CACGCCGTGG CTGTTGCCCT CTGCTTACTC
GATCAGCAAT CACAGAATGA AGACGGTGAA CGTGAGCAGA TCCGCCGTTA TCTAAGCACA
TTAAGCGAAG AGGCGAGGCT CGAGATGTTA CTTGATTATC TCGAGCGAGA TAAGTCGGTT
TGGCAAGCCT TACTGGCGAA AGAACAACCC AAAGATCAGA GCCTCAGTTA TAACGAGCTG
AAGAAAAAAA TTACCCAAGC GCTCCCTCGT AAACAGCTTT GGGATTGGGG GGACGTGAGC
GATTACTTTG CACAGGCCGA AGAGCAGCTC GACTGGGTAT TTGATGTTGC GATGCAACTT
GCAACCGATC AGCAGTGGAA ACTGATCCAG CACACAGTCA CTCGTTTGAA CAGCGTGTTA
GAGCAGATTG ATGACTCCAA CGGTGAACGT TTTGGAGTCG AAGCGCTGAT CAATGTTCAG
ATGCCGATGA TTTTGAACCG ATTGGAGTGG AGTGAGGAAG AGAAAGCGCA GTGGATGTTT
GAGCGCATGA CCCACCATGA GTTTGATGTG TATCCCTCAA TTGAAACCGA CTTCGCGATC
GTTTGGCGCA GCAACCCAAC ATTTTTACAC TTGTGTCGCA AAGCCATTGA GAACACACCA
CTAAACCGTG AAAACGCGTG GGATCTCAAA ACGTGGGCCG CACCACTACT GGCCATGGCA
GCGGATTGGC ATGAAGTTAT CGCCATTAAG CAAAAAATCG CGTGGCGCTG CGATGATTTT
CTGGACATAG TGCAGCTCTA CCTTGAGCAC CAAGAGCCAC ATCAAGCGGA ATTCTGGTTA
GCTAAAGCCA AAAAGCACGC GACCCCTTAT GAAAAGCAGC AGTGCGAGCA ACTGCAGGTC
AACCTCTACC TCTGTTTGGG GCAAATGACT CAAGCGTGGC AGTTGGCGAA CCGACTTTTC
ACTGAGAATC CCTCCTTTGA TAGTTACCAA AAGCTGAGTG CGTTCAAAAC CACACACCAA
ATTGAAGATG CGGAGTTTTT AGCGCGTATC GAACAGAGCT TGATCGCCTG CTATCAACCC
GCTGATGCAC GAGGATTTAT CAGCCGCAAC AGTAGTGATG TGGTCGAATT TTATATCGAG
CAGCAAGCGT GGAAAAAAGC CTGTGATTGG GTAGCCAATC GAGCCACCGA AAGTCAGGTA
CTACTCAGGT TGGCCGATCA CATTATTGCC AACCAACCTC AGATCTCGTT GCAGTACTAC
CTTCGCGTGG CGCGCGCTTC AATTGAACAA ACCAGCAACC AAGGTTATCA AAATGCGATT
CACCATCTAC AACGTATCGA GCGTTTATTA GCCAAAAATC CCACCGTGTT GGCCGATTTT
TATCTCGAAA TCACCGCTCT GGCAGAAGCC TACAAACGTA AGCGAAATAT GCATAACTTA
TTGAAAAAAC ACTATCCCAA GCAGGTTTAG
 
Protein sequence
MNHRDSIGKV QFILSYLYSR KPNNFEMELP MPSHSSFLDI SALFEQCEQS SLLKGAQLAK 
SGSVRKLTIT GHTVNAEVKG SQLYRVQLTG GQASSSRCTC PAAGHQAVCK HAVAVALCLL
DQQSQNEDGE REQIRRYLST LSEEARLEML LDYLERDKSV WQALLAKEQP KDQSLSYNEL
KKKITQALPR KQLWDWGDVS DYFAQAEEQL DWVFDVAMQL ATDQQWKLIQ HTVTRLNSVL
EQIDDSNGER FGVEALINVQ MPMILNRLEW SEEEKAQWMF ERMTHHEFDV YPSIETDFAI
VWRSNPTFLH LCRKAIENTP LNRENAWDLK TWAAPLLAMA ADWHEVIAIK QKIAWRCDDF
LDIVQLYLEH QEPHQAEFWL AKAKKHATPY EKQQCEQLQV NLYLCLGQMT QAWQLANRLF
TENPSFDSYQ KLSAFKTTHQ IEDAEFLARI EQSLIACYQP ADARGFISRN SSDVVEFYIE
QQAWKKACDW VANRATESQV LLRLADHIIA NQPQISLQYY LRVARASIEQ TSNQGYQNAI
HHLQRIERLL AKNPTVLADF YLEITALAEA YKRKRNMHNL LKKHYPKQV