Gene Acid345_4097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4097 
Symbol 
ID4072288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4856689 
End bp4857879 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content57% 
IMG OID637986128 
Productphosphoesterase 
Protein accessionYP_593171 
Protein GI94971123 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.217924 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTACGG ACAGACTCAA GCAGTTTTGC CTGGCGGTAG CGGTGTGCGT ATTGAGTTTC 
GCGGCGATTA GTTCAGCGCA AGTTCCTTCC TCCAACCATG TATTCGTTCT TATGGAGGAG
AACCACAGTT ACAGCCAGGT GGTCGGCAGT CCGAACATGC CGTACCTGAA TTCCCTGATC
CAGCAATATG GGCTGGCGAC GAATTACGAC GCGAACAGCC ACTACTCGCT GCCCAATTAT
TTCTGGATCA CCACCGGCAA GTACCTGACA CTGAACGACG GCACCACCGC AGTATTCAAC
GTGGACAACG TGGTGCGGCA AATGCTCTCG GCGGGCAAGA CCTGGAAGGC GTATGAGGAG
AGCATTCCTT CCGAGGGGTA CACGGGTGGA ACGGTTGAGC CGTACGAGAA GAACCACAAT
CCGTTTTCCT ATTTATCGGA CGTGGTGAAC AGCTCGCAGG CGAACAACAT GGTTCCGTTC
ACTCAGTTCG CAACCGACAT CGCGAATAAC GACCTCCCGA ACTTTGGATG GATCGGTCCG
AATGGCACGC ATAACGGACA CAACGCTGGT CTACCGACGA TGGACGCGTG GCTGAAGGCG
AACCTGCCGC AATTGCTAAG CAGTCCGGCA TTCCAGCCCG GCGGCGATGG ACTGTTAATC
ATTACGTTCG ACGAATCGGT GGACTCCGAC TGCGCGCCAT TGGCGAGCTG TCCGAAGCTT
CCGGAAAACG GCGGCGGCGG GCACGTTGCA ACGGTATTGA TTGGGCCGAA CGTGAAGAAG
GGATTCCAGT CGAGCACGTT CTACCAGCAT CCGAGCGTGC TGCGGACGGC GCTGATGGCG
CTTGGCATCA ACAGCGCTCC GGGCGCCGCG GCTACGGCAC CCCTCATGAC GGACTTCTTC
TCGGCGACGA CAACTGGCTG CACGGGCACC GGCACGAACC AGACGGTAAC GATCTGCTCC
CCTGCGAATG GAACGACGAT GGGAACCACA GCCGTAGAAG TGAGCGCGGT GGCGAACGAC
TCGAAGCCGG TGAGCTTCAC GCAGGTTTAC ATTGATGGCG TGAAGAAGTA CGAGGTGATG
GGCGGCAAGG TTGATACCAC TCTGACGCTT GCGTCTGGAA CGCGGCGCAT TACGGTGCAG
GCCTATGACG GAATTTATTT CAAGAGCACG ATCTATGTCA CGGTGCCGTA G
 
Protein sequence
MFTDRLKQFC LAVAVCVLSF AAISSAQVPS SNHVFVLMEE NHSYSQVVGS PNMPYLNSLI 
QQYGLATNYD ANSHYSLPNY FWITTGKYLT LNDGTTAVFN VDNVVRQMLS AGKTWKAYEE
SIPSEGYTGG TVEPYEKNHN PFSYLSDVVN SSQANNMVPF TQFATDIANN DLPNFGWIGP
NGTHNGHNAG LPTMDAWLKA NLPQLLSSPA FQPGGDGLLI ITFDESVDSD CAPLASCPKL
PENGGGGHVA TVLIGPNVKK GFQSSTFYQH PSVLRTALMA LGINSAPGAA ATAPLMTDFF
SATTTGCTGT GTNQTVTICS PANGTTMGTT AVEVSAVAND SKPVSFTQVY IDGVKKYEVM
GGKVDTTLTL ASGTRRITVQ AYDGIYFKST IYVTVP