Gene VC0395_A2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2018 
Symbol 
ID5136402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2171358 
End bp2172875 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content50% 
IMG OID640533475 
Producthypothetical protein 
Protein accessionYP_001217942 
Protein GI147675057 
COG category[S] Function unknown 
COG ID[COG3025] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000169644 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACCG AGATAGAACT GAAGTTTTTT GTTTCTCCAG ATTTTTCAAC CATCTTACGC 
GCGAAGATTT CTGAAACCAA AGTTCTTCAG CACAGCTGTC GGGAGTTAGG AAACACGTAC
TTTGATACCC CCGATAACTG GTTGCGTCAG CATGATATTG GACTGCGCAT CCGTCGTTTT
GATGAGGTGT ATATCCAGAC GGTGAAAACC GCAGGTCGTG TGGTCGCCGG TCTACATCAG
CGTCCGGAGT TCAATGCCGA ACATCACAGT AATGAGCCCG ATCTTTCGTT GCATCCTGCC
GATATCTGGC CGCAAGGAAA AGAGCTGACT CAGCTACAAG CCGAACTCAT GCCGCTCTTT
TCGACCAACT TCACTCGTGA ACAGTGGCTG ATTAGCATGG CTGATGGTAG CCAAGTGGAA
GTCGCCTTTG ATCAAGGTCT TGTGGTGGCG GGTGATCGCC AAGAGCCGAT TTGTGAAGTG
GAGCTGGAAC TTAAGTCTGG TCAAACCGAT GCACTGTTTA CCTTAGCGCG CCAGCTTTGT
GAGCATGGCG GTATGCGCTT AGGGAATCTT AGCAAAGCCG CTCGTGGCTA TCGCCTCGCT
GCCAATTATT CGGGCGATGA AATTCAACCC ATGGCTTTGG TCAGTGTCGA TAAAAATGAC
ACTGCCGAGT CTTGCTTTAT CCGTGCACTG GAACATGCGC TGGCGCACTG GCATTACCAT
GAGCAAATCT ATACCGAACG TGAAAACGTG GCGGCATTGC ACGAAATTCG TCATGCAGTG
AGTTACCTGC GCCAACTGCT TTCCGTCTAC GGCGGCATCA TTCCGCGCCG CGCCAGCGCG
ATTTTGCGCC AAGAGCTCAA ATGGTTAGAG CAAGAGTTAC AGTGGCTAAA AGAGTTTGAA
TATCTGGAAA GTTTGCAAGA AGACAAGGGC TACGCGCTGC GTAAACTGGA TGCGCGTAAG
TTTTTAGTCA CCGCGTTGAA AACCTTGCAA GAAAGCTTGC CACAGCGTGA AGATACACTG
CGTTTACTGA GCAGTGCCCG TTATACCGGC TTGTTACTGG ATCTAAGTCG TTGGGTGCTG
ACACGTGGTT GGCAGCCGTT TTTAGATGAC AAAGCACGAG AAAAAATGGC GCAGCCGCTG
GAAGCGTTCT CCGTAAAACA ACTTGACCGC ACATGGGCGG AGTTGATGGA AGCTTTTCCA
CCGGGTAAAA CCTTAACCGT ACAAGAATAC CTCGATCAGC AGTATCGCCT AATGCGTAAC
CTTTACACTG GGGTGAGTTT TGCAAGTTTA TACGATGCTG AAAACCGTCA GGCATTTCGG
ATGCCGTGGG CGGATTTGTT GCACGGTATT GATGATTTAT TGCGCCTCAA ACCATTAGAG
CGTTTGGTGG ATTTGCTACA AGGTGAAGAG CAAGATCAAC TGAAACGTTG GCTGATTCGT
CAAGAGAACT CCATTTTGCA TGCGATGGAG CAAAGCCGCA CGATGGGTGT TGAAGCGCAT
CCGTATTGGC GTGAGTAG
 
Protein sequence
METEIELKFF VSPDFSTILR AKISETKVLQ HSCRELGNTY FDTPDNWLRQ HDIGLRIRRF 
DEVYIQTVKT AGRVVAGLHQ RPEFNAEHHS NEPDLSLHPA DIWPQGKELT QLQAELMPLF
STNFTREQWL ISMADGSQVE VAFDQGLVVA GDRQEPICEV ELELKSGQTD ALFTLARQLC
EHGGMRLGNL SKAARGYRLA ANYSGDEIQP MALVSVDKND TAESCFIRAL EHALAHWHYH
EQIYTERENV AALHEIRHAV SYLRQLLSVY GGIIPRRASA ILRQELKWLE QELQWLKEFE
YLESLQEDKG YALRKLDARK FLVTALKTLQ ESLPQREDTL RLLSSARYTG LLLDLSRWVL
TRGWQPFLDD KAREKMAQPL EAFSVKQLDR TWAELMEAFP PGKTLTVQEY LDQQYRLMRN
LYTGVSFASL YDAENRQAFR MPWADLLHGI DDLLRLKPLE RLVDLLQGEE QDQLKRWLIR
QENSILHAME QSRTMGVEAH PYWRE