Gene VC0395_A0144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0144 
Symbol 
ID5137259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp140450 
End bp142174 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content50% 
IMG OID640531604 
Productendoglucanase-related protein 
Protein accessionYP_001216109 
Protein GI147673295 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTTAC TGACCAACCA CATTGGCTAT GAAACCCAAG GCCCTAAACA GGCGGTGTTG 
CTGTGTGGAC AAACACAACT CATGGACGAT TGTGTGCTTT TGGTTTGCGC TCGTAGTCAC
CAAACCGTTG CCAAGCTCGC TATTGAGTGG CACGGCAAGG TCGACAACTG GCATCAGGGA
CAATTTCATC GTATCGATTT TTCCGATTTC ACGACGCCAG GCGACTATTA TCTGCGCCTG
GAACACACGC ATTCTGCCAC TTTTACTATT GCGCGAGGCG TATTAATGCA ACGCACATTT
TCTGATGTGC TGCACTACTT TAAATCGCAA CGCTGCTCTG GGCAGTTTGA TCAACAAGAC
AAGCAAGTAC CGCTGCTGAG CACATCAACC ACTGCCGATG TGCACGGTGG CTGGTACGAC
GCTTCAGGTG ACGTTAGCAA GTATCTCAGT CACCTTTCTT ACGCTAACTA TTTGAACCCA
CAACAAACAC CTTTGGTGGT CTGGAACATG CTCAAAGGGT TAGCGGTTTT ACAACATCAC
TCGGGTTTTG CATCGTTTTC TCGCACTCGC CTCAAGGATG AAGCGTTGTT TGGTGCTGAT
TTTCTACGCC GTATGCAAAA CTCCGAGGGA TTTTTTTATA TGACCGTCTT TGACAAATGG
AGCAAAGACA CCAAACAACG GGAGATTTGT GCCTACGCGA CCCAACAAGG CCATAAATCC
GATGATTATC AAGCGGGTTT TCGCCAAGGT GGGGGCATGG CGATTGCGGC ACTTGCCGCA
GCCGCGCGTT TGGATACGCA CGGCGAGTTC ACACAAGCTG ACTATTTACA AGCGGCAGAA
AATGGCTACT GGCACCTCAA AGAGCATAAC CTCGCCTACC TCAATGATGG GGTTGAAAAC
ATCATTGATG AGTACTGCGC ACTCTTGGCG TGTTGCGAAC TTTACCGCAC GACAGAGAAT
GACCAATATC TGGCTCAAGC TCGTGAGTGG GCACAGCGTT TAGCCAAGCG CCAATGCAGC
GATGAACAAA TTGCGCACTA CTGGTCTGCC ACCAGCAACG GTGAGCGCCC ATACTTCCAC
GCCAGTGATG CTGGTCTGCC TGTCATTGCA CTTTGCGAGT ATCTGAATAT TGAAACGGAT
ACGGCTAACT ACGCTCAACT CCAAAGAGTG GTCGAGCAAG CCTGTCAATT CGAGTTAGCG
ATAACTCAAC AAGTCTCCAA CCCGTTTGGG TACCCGCGTC AGTATGTCAA AGGGGTGGAA
AGCGCCAAAC GCACCAGTTT CTTTATCGCT CAAGACAACG AAAGTGGTTA CTGGTGGCAA
GGTGAAAATG CCCGTCTCGC CTCGTTGGCG AGCATGGCCT ATCTTGCTCA GCCTCATTTG
AGTACCGCTA TCGCTAAACC GCTTGAACAG TGGTCACAAA ATGCCCTGAA CTGGATTGTC
GGGCTCAATC CTTACAACAT GTGCATGCTC GATGGACATG GGCACAATAA TCCCGATTAC
TTACCTCATT TAGGCTTTTT CAATGCCAAA GGCGGTGTGT GTAACGGCAT AACCGCGGGC
TTTGATGACC CAAGAGATAT CGCGTTTAAC CCAGCAGGGC AAAAAGATGA CATGCTGCAA
AACTGGCGTT GGGGAGAACA ATGGATCCCG CATGGCGCTT GGTATCTGCT TGCCATCATC
AGCCAATTTG CTCACTTTAC CGCTCACGGG GAGGAGAACC AATGA
 
Protein sequence
MLLLTNHIGY ETQGPKQAVL LCGQTQLMDD CVLLVCARSH QTVAKLAIEW HGKVDNWHQG 
QFHRIDFSDF TTPGDYYLRL EHTHSATFTI ARGVLMQRTF SDVLHYFKSQ RCSGQFDQQD
KQVPLLSTST TADVHGGWYD ASGDVSKYLS HLSYANYLNP QQTPLVVWNM LKGLAVLQHH
SGFASFSRTR LKDEALFGAD FLRRMQNSEG FFYMTVFDKW SKDTKQREIC AYATQQGHKS
DDYQAGFRQG GGMAIAALAA AARLDTHGEF TQADYLQAAE NGYWHLKEHN LAYLNDGVEN
IIDEYCALLA CCELYRTTEN DQYLAQAREW AQRLAKRQCS DEQIAHYWSA TSNGERPYFH
ASDAGLPVIA LCEYLNIETD TANYAQLQRV VEQACQFELA ITQQVSNPFG YPRQYVKGVE
SAKRTSFFIA QDNESGYWWQ GENARLASLA SMAYLAQPHL STAIAKPLEQ WSQNALNWIV
GLNPYNMCML DGHGHNNPDY LPHLGFFNAK GGVCNGITAG FDDPRDIAFN PAGQKDDMLQ
NWRWGEQWIP HGAWYLLAII SQFAHFTAHG EENQ