Gene VC0395_A1486 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1486 
Symbol 
ID5136376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1595582 
End bp1597000 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content48% 
IMG OID640532944 
Producthypothetical protein 
Protein accessionYP_001217429 
Protein GI147674989 
COG category[S] Function unknown 
COG ID[COG3014] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.00338852 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGATGG TGACGAATCA CGTGGTTCAA TACATTAAAC GACCTATACG TAGCCTCACT 
CTTACTTCAC TACTGCTCGC GGGGTGTGCC AATCTCACTG CGGGCAATCT TTTTAGCCAT
TACAGTGCGC AAAACAGTGA TTTATATCAG TCTGTTGCGG CAGGGCAATA CCAACAAGCT
ACGCAGTTAT TGCCCGATTA TGTCGCCGGT GAAGTGTTAG ACAACCTTGA AAAAGGTCGG
GTTTATTTTC TTAGCCAGCA ATACGCAGAA AGCCAGCAAA GTTTGGCTGC TGCAGATTTG
GCGGTTAAAC AGCAGCAAGA CCGAGCGATC ATCTCTGTGA GTAGTACCGC GACCAGTGTG
GGTTCGCTGG CGGTCAACGA TAACTTGAAT GAGTACGAAC CTGCCGATTA TGAACTGGGC
TTTTTGCACC TGTACTTAGG GCTGAATTAT GTGCGCAATA ATGATCTGGA TGGTGCATTA
GTTGAAATGC GCCGCGCTAA CCAAGTTCAG GAAGCCGCTA AAAAACGCCG TGAACAAGCG
TTGTCCAGTG CGGAGCAAGA GATGCGCTCA CAAGGATTAT CCCCGAACTT AGGCAGTGTG
CTTGCACAAT ATCCAGATGC CGGAAAAACC TTACAAGCGG TGCAAAATGG TTATCTGTTG
TATCTTTCGG CGCTGCTTTA TGAAGCGGAT AATGATTTGA ACTCGGCTTA TGTTGACTAC
CGCAGAGCAC TGGCGGTTGC CCCCAATAAT CCATCGGTTA TTGATGGAAC ACTTCGAGTC
GCGTCTCGGC TTGGCATGCA ACAAGATCTC AAGCTTCTTA AACAGCAATA TGGTGAGCCT
AAACGCTTAA CCTCTGCGCA AGGCCGAGTC ATCGTCATAG AAGAGCAGGG TATAGTACAA
CCTAAGCAAG CGTGGCGACT CTCTTTACCT TTGTCTGACA GTCGTGGCAA TACGGCGCTG
TATTCTTTGG CTCTGCCTTA TTATTCATCT TCGGTGGCGC AGAGTCGTGT ACCGATCAGT
CTAAATGGCG CTAGTTTGGC CTCTATGCCA GTTACTGATG TGGATTTGAT GGCTAAGCAG
GCATTAACGG AGCAAATGCC TGCGCTGGTA CTGCGCCAAG CGCTGCGTGT GGTCGCCAAA
GACCAATTAC GCAAAGAGAC GACTCAGGAA GAGGATGTCG CCAATTTAGT GTTTAATATT
TGGAATACGT TAACGGAGCA GCCTGATACA CGCAGCTGGT TAACCTTGCC CGCCACAGTC
AATACGGCCA CCCAAGTGGT GAAAGCGGGT GATCAAGTGC TGGATATTGC AGGGACTCAC
TATACCTTTC ATGTTCCAGA AAATGGCACA GTGTTAGTTT GGCTATCTCA CCAAGGTAAC
AATGTGACAA TTTGGCATAA ACAACTAGGG ATACGGTAA
 
Protein sequence
MLMVTNHVVQ YIKRPIRSLT LTSLLLAGCA NLTAGNLFSH YSAQNSDLYQ SVAAGQYQQA 
TQLLPDYVAG EVLDNLEKGR VYFLSQQYAE SQQSLAAADL AVKQQQDRAI ISVSSTATSV
GSLAVNDNLN EYEPADYELG FLHLYLGLNY VRNNDLDGAL VEMRRANQVQ EAAKKRREQA
LSSAEQEMRS QGLSPNLGSV LAQYPDAGKT LQAVQNGYLL YLSALLYEAD NDLNSAYVDY
RRALAVAPNN PSVIDGTLRV ASRLGMQQDL KLLKQQYGEP KRLTSAQGRV IVIEEQGIVQ
PKQAWRLSLP LSDSRGNTAL YSLALPYYSS SVAQSRVPIS LNGASLASMP VTDVDLMAKQ
ALTEQMPALV LRQALRVVAK DQLRKETTQE EDVANLVFNI WNTLTEQPDT RSWLTLPATV
NTATQVVKAG DQVLDIAGTH YTFHVPENGT VLVWLSHQGN NVTIWHKQLG IR