Gene VC0395_A2472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2472 
Symbol 
ID5135090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2623253 
End bp2624428 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content50% 
IMG OID640533923 
Producthypothetical protein 
Protein accessionYP_001218365 
Protein GI147674460 
COG category[S] Function unknown 
COG ID[COG1652] Uncharacterized protein containing LysM domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATTTCTC CGAAACCCAG TCATTTTTTC GACCAAGGAA GCAAGGTCAT GGCACGATTA 
ACCCGTTATA TTACCTACGC TTTATTGCCT TTCACTCTGC TCAGTGGACA GGTGACCGCC
GATGAGCAAA CGCCGCTGGC ACTCAAACCA AATGCACCGA CCACCTATAC CGTGGTGAAA
GGTGACACCT TGTGGGATAT TTCCGCTTTG TATCTCGACA GTCCATGGCT GTGGCCAAGA
CTTTGGCAAG TGAACCCAGA AATTGATAAC CCACATTTGA TCTACCCCGG TGATAAATTG
ACGCTGTTTT GGCGTGATGG GCAACCCGTA CTCAGTCTAA AACCGATGCG CAAACTGAGC
CCGCAAGTGC GTGTGTTGGA GAAACAAGCT GTTCCGACTG TGCAAGAAGG CTTGGTGCTG
CCTTATTTAC AATCTGACCG TTTGATGGCG AAAACCGCCT TGCAAGGTAG CGTGCGTGTG
ATTGGCTCCA GTGAGGGGCG TCAATATCTC ACCAAGCAAG ATCAGCTTTA CATCTCCGGT
GTACACAGCG AGAAAAAGTG GGGGATTTAT CGCGAGGTAG CACAGTATCA ACGTGATGAT
GAAGTCATGG TGGCGCTGCG TTTAGTGGCG GTGGGTGAAT TGGCGATGAC TGGGGGCAAC
TTTAGTGGAC TAAGCTTGCT AGAGCAAAAT CAAGAGATTT TAGCCAATGA TATTGCTTTG
CCAGAAGTCG ATCTTGAGGA GCGCCAACTG TCGACAACCT TCTATCCGCA GCCAGCGCCT
GCAGGGAGTG AAGCTCGCAT TCTGGGTTCG CTTGAAGGGA GTCAATACGC CGGACAGAAT
CAAGTGGTGG TGATTGACCA AGGCCGCAGT GATGGCGTCG CGCAAGGCAG TATGTTTGAA
CTGTATCAAG CCGCAGTCCA AGTCAAAGCT AAGCAAGATT CGTCTACTTT CCTGTCTGAG
CGTTCGAACA CGGACGTACA GTTACCAAGT GTAAAAGTGG GCGCTTTGAT GGTGATTCGC
CCTTATGAGC GTTTTAGCTT GGCACTGATT ACTAACAGTT CGGCACCGAT CAGTGCTGAA
GTGCACGCAC TGTCTCCGCA GGCAGAGCCT CAAACCGAGC AGCCAGATGT AGTGCAATTG
TCTGAACAAA TCGATTTGGT TGATGATGCC AGTTAA
 
Protein sequence
MISPKPSHFF DQGSKVMARL TRYITYALLP FTLLSGQVTA DEQTPLALKP NAPTTYTVVK 
GDTLWDISAL YLDSPWLWPR LWQVNPEIDN PHLIYPGDKL TLFWRDGQPV LSLKPMRKLS
PQVRVLEKQA VPTVQEGLVL PYLQSDRLMA KTALQGSVRV IGSSEGRQYL TKQDQLYISG
VHSEKKWGIY REVAQYQRDD EVMVALRLVA VGELAMTGGN FSGLSLLEQN QEILANDIAL
PEVDLEERQL STTFYPQPAP AGSEARILGS LEGSQYAGQN QVVVIDQGRS DGVAQGSMFE
LYQAAVQVKA KQDSSTFLSE RSNTDVQLPS VKVGALMVIR PYERFSLALI TNSSAPISAE
VHALSPQAEP QTEQPDVVQL SEQIDLVDDA S