Gene Acid345_3848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3848 
Symbol 
ID4071000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4557038 
End bp4558147 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content58% 
IMG OID637985872 
Productphosphoesterase 
Protein accessionYP_592922 
Protein GI94970874 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3511] Phospholipase C 
TIGRFAM ID[TIGR03397] acid phosphatase, Burkholderia-type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.352013 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTGA GTTCAACACC CGGACCGGGA ACCGGCGGGA CCGCGGTTTC TCCGGTGAAG 
CGGGTGATTG TGCTGATCCT GCAGAACCAT TCGTTCGATT CGTTGTTTGC GACGTATCCG
GGCGTGATGG ACCCGTTGTC ATCAGGTTCG CCGGGATACA CGCAGGCGAG CGCGAGCGGT
GGCGGTACGG TCACGCCGTA CTTGCTAACT GATCCATTTC CCGCGGACAT GCCGCACGGC
GCGAAGTACT ACAATGCGAG CATCAACGGC GGAAAGATGG ATGGGTTCGC GGTCGCTGAG
CAGACCAACG TGTCGATGGG GCATTACGAC AGCACGATTC CCGGCGTGGA TACGATCTGG
AATTACGCCG GACAGTTCGC GCTAGCCGAC AACTTCTTTA TGCCGGATGT TGGAACGGAG
CCGAACCTCG CACTGATGAT GATCTCGGCG CAAGGTACGG GGAACGAATT CGGGGTACAG
CCGTCCTACG GACCGTGCAA CAAGACGGAC CCGGATGCGA AGGCGCTGAC GAACAAGAAT
GTCGGCGACG AAATGACTAC AGCCGGCGTG ACGTGGAGCT GGTTCCACGA GCAGTATGGC
GTTTGCGGCG ATTACGTGGC GACGGAGAAC CCGTTCCAGT ACTTCACGAG CACGCAAAAC
AGCGCGAATT TACAGGACAT TTCGCTCTTC TATTCGCAAC TGGACGGTGG GACGCTGCCG
TCAGTTTCGT TCGTGAATCC GGGCGGCGGA CATAACTGCC ATCCGGGAAA CAGTTCGATT
ACGACGTGCG CGGAGTATCT CGACAAGCTG GTGCAGCGAA TCCAGAAGTC GCCGGTGTGG
CCGGACTGCG CGGTGGTGGT GGTGTGGGAC GAGAGCGGCG GGTTCTACGA TCACGTGCCT
CCGCCAACGG TGGGCGGAAA CTTGGATGGG ATGCGGATAC CTATGATGGT GATCTCGCCG
TACGCGAAGA CTGGATACAT CTCGCATGTG CAGATGGACT TGGTTTCACT CTTGCGGTTT
ATCCAGTGGA ACTGGACGCT GCCGAACCTG AATTCGCGGA ACTCCGCGCC GGGTGCAACG
ATTGAGATGA AGGACATGTT TACGTTCTAG
 
Protein sequence
MAVSSTPGPG TGGTAVSPVK RVIVLILQNH SFDSLFATYP GVMDPLSSGS PGYTQASASG 
GGTVTPYLLT DPFPADMPHG AKYYNASING GKMDGFAVAE QTNVSMGHYD STIPGVDTIW
NYAGQFALAD NFFMPDVGTE PNLALMMISA QGTGNEFGVQ PSYGPCNKTD PDAKALTNKN
VGDEMTTAGV TWSWFHEQYG VCGDYVATEN PFQYFTSTQN SANLQDISLF YSQLDGGTLP
SVSFVNPGGG HNCHPGNSSI TTCAEYLDKL VQRIQKSPVW PDCAVVVVWD ESGGFYDHVP
PPTVGGNLDG MRIPMMVISP YAKTGYISHV QMDLVSLLRF IQWNWTLPNL NSRNSAPGAT
IEMKDMFTF