Gene VC0395_A1872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1872 
Symbol 
ID5135918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1990408 
End bp1991496 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content49% 
IMG OID640533329 
Producthypothetical protein 
Protein accessionYP_001217796 
Protein GI147675459 
COG category[R] General function prediction only 
COG ID[COG3608] Predicted deacylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCCAG TAAAAGGACT AAAACTCGTG GCAAAGGCTA AAAAGCACGG CGACTTTTCA 
TTTTTGGGGG AAACCATTCC CCCCTCTTCT CGGCGAGTTA TCGAACTCGA AGCCGCCAAA
CTGTATACCG ACTCCCCTCT TTCTATCCCC ATCGAAGTTC TCCATGGCGC ATCTCCCGGC
CCCGTTTTGA TGATCAATGC CGCCATTCAT GGTGATGAAC TCAATGGCGT AGAAATCATC
CGCCAACTGC TCAATACACT TGATGAAAAA AAACTCAAAG GCACAGTGAT TGCGGTGCCT
ATCGTCAATG TTTTCGGTTT TATTCACAAA TCACGCTACT TACCCGATCG TCGTGATTTA
AATCGCTGCT TTCCCGGCAG TGAAAAAGGC TCTCTAGCCT CACGTATGGC GCACACCTTT
TTTTCCCAAG TCGCCGAGCG TTGTGATTAC ATCCTCGATC TGCATACCGG AGCGATTCAC
CGAACCAACC TACCGCAAAT TCGTGCCGAT CTGAGCAACA GCGAAACCCT GCGCATTGCA
CAAGCCTTTG CTACTCCAGT GATCATTGAT TCACCTTTAC GCGATGGCTC ACTGCGTAGT
GAAGCGGAAA AGCAGCAGAT CCCCGTCTTA ACTTATGAAG CGGGGGAAGC TCTGCGTTTT
GATCCGATTG CGATCAATGC TGGGATCATC GGTATCAAGC GAGTGATGCA GTCCATCGGC
ATGCTGCGCC CTAGCCGTAA AAAGATACCT AACTCGATCA TTGCAAAATC AACCAGTTGG
CTACGTGCCG AAGCCGACGG TATTTTACGT ACATTGGTGT CTTTAGGAGA TAAGGTCGAA
AAAGGCCAGG TGTTAGCTTA CATCAACTCC CCACTTGGCA AGCTAGAAGT GGAGATTCGA
GCCAACAAGA GCGGCATCGT GATTGGGCAA CAAACGCTCC CCTTAGTCAA TGAAGGGGAT
GCGGTGTTCC ATCTCGCCTA TTTTCATAAA GCTGATGACC TTATCGAACA AGTGGTCGAA
GAGTTTATCG AAGAGTTAAC GGAAGCCGAT TTGGAGCCCT TAACCACAGG ACATCTTGTC
ACTCTCTAA
 
Protein sequence
MQPVKGLKLV AKAKKHGDFS FLGETIPPSS RRVIELEAAK LYTDSPLSIP IEVLHGASPG 
PVLMINAAIH GDELNGVEII RQLLNTLDEK KLKGTVIAVP IVNVFGFIHK SRYLPDRRDL
NRCFPGSEKG SLASRMAHTF FSQVAERCDY ILDLHTGAIH RTNLPQIRAD LSNSETLRIA
QAFATPVIID SPLRDGSLRS EAEKQQIPVL TYEAGEALRF DPIAINAGII GIKRVMQSIG
MLRPSRKKIP NSIIAKSTSW LRAEADGILR TLVSLGDKVE KGQVLAYINS PLGKLEVEIR
ANKSGIVIGQ QTLPLVNEGD AVFHLAYFHK ADDLIEQVVE EFIEELTEAD LEPLTTGHLV
TL