Gene Acid345_1449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1449 
Symbol 
ID4071638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1750443 
End bp1751696 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content58% 
IMG OID637983458 
Productphosphodiesterase I 
Protein accessionYP_590525 
Protein GI94968477 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.982314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGGC TTCTAGCTCG GCTGTGCTTC CTGCTGCTGA TCTCTGTTGG CGCTTTTGCG 
CAAAGCATTC CCGTCATCAC TGTCGATCAA CCGCCGAACT CAGCGGAACA GCAGAAGAAG
CACTATGTAG TTCTGGTGTC GCTGGATGGC TTTCGATATG ACTACGCGAA AAAGTATGGG
GCGACGCATC TCCTGGAAAT CGCCAAACAT GGGGCGAGCG TTCCCGACGG AATGATTCCG
TCGTATCCGT CTCTGACCTT TCCGAACCAC TACACCTTGG TGACCGGACT GTATCCCGAG
CACCACGGGA TTGTGGGCAA CGAGTTTTAC GATCCGGAGC GCAAAGCGAA GTACGCGTTG
AGTGACAAAG CGGCGGAAGC CGATGGTACG TGGTACGAGG GCACGCCGTT GTGGTCGCTG
GCGGTGAAGC AGGGAATGCG TAGTGCCTGT TTCTTCTGGC CGGGATCGGT GGCGGAGATC
GCAGGAGCGC GGCCCACTTA CTACCTGCGT TACGACAACT CCATCCCGGA CGAGAGGCGT
GTCGAGCAGG TCATCGCGTG GCTCAAGCTT CCTGCCGAGC AGCGTCCTCA CTTCATCACG
CTGTACTACT CGAAGGTCGA TCATGCCGGA CACGAATTCG GCCCGGACAG TCCGCAGGTT
GCGGAGGCAG TGAAGAGTGT GGATGCAACG ATTGGCCTGC TCGAGGAAAA CCTGCAGGCA
CTGCATCTGC CGATTGACCT GATCGTTGTT TCGGACCACG GCATGGCGAA GACGGACCCG
AATTGGGTCA CGCTCGACAA ATATGCTTCC CTCGACGGCT TCGTGACGGT TGGGCTGAGC
CTGTACGCAC CCTCGGAAGC TGCCGCCGAA AAGGCGTACG AGAAGCTGAA GGGCGGGGAT
TCGCGCTTCA ACGTCTACCG GCGGAAGGAC GTGCCGGCTG AGCTGCATTT CAACAGCAAT
CCGCGCGAAG GAGATCCGGT GGTGGTTGCA AAGGGTTCGT GGGCGATCCG CGCGACCACA
AACTCCTACG GCGGAGACAA GCCACCGAAT ATTGGCAATC ATGGGTTCGA TCCGCGTGTG
CTGCCGGAGA TGAAGGCGGT CTTTTACGCG GAGGGGCCCG ACATCAAACC CGGAGTTCAG
TTGCAGAGCT TCGAGAACGT AAACGTGTTC CCGTTGATTG TGGAACTGCT AGGTCTTGAT
AGCCCGAAGG TTGACGGTGA CCCCAAAGTT CTTTCGGGAA TTGTTAAGAA ATAG
 
Protein sequence
MKRLLARLCF LLLISVGAFA QSIPVITVDQ PPNSAEQQKK HYVVLVSLDG FRYDYAKKYG 
ATHLLEIAKH GASVPDGMIP SYPSLTFPNH YTLVTGLYPE HHGIVGNEFY DPERKAKYAL
SDKAAEADGT WYEGTPLWSL AVKQGMRSAC FFWPGSVAEI AGARPTYYLR YDNSIPDERR
VEQVIAWLKL PAEQRPHFIT LYYSKVDHAG HEFGPDSPQV AEAVKSVDAT IGLLEENLQA
LHLPIDLIVV SDHGMAKTDP NWVTLDKYAS LDGFVTVGLS LYAPSEAAAE KAYEKLKGGD
SRFNVYRRKD VPAELHFNSN PREGDPVVVA KGSWAIRATT NSYGGDKPPN IGNHGFDPRV
LPEMKAVFYA EGPDIKPGVQ LQSFENVNVF PLIVELLGLD SPKVDGDPKV LSGIVKK