Gene Acid345_1521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1521 
Symbol 
ID4073009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1856768 
End bp1857997 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content56% 
IMG OID637983530 
Productphosphoesterase 
Protein accessionYP_590597 
Protein GI94968549 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.516758 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGA TCATCGCAAG CCTTACGCTC GCCGCTACGG CGTTCGCGAA TACGTCATTG 
TTTGCCGCCG AAGGAAACGC CGCTCCCACG GGCGTTCTTC ATCTCGATCA CGTTTGGGTG
ATCCTGAGGG AAAACCATGG GTATGGCCAG CTTCGCAACA ATCCAAACGC TCCGTTTATC
AACAAGTACG CGGAGAGTGC GAATACGGCT GACAACTATT TTGCGGTTGC GCATCCCAGC
CTGACGAACT ACCTGGAAAT CGTGGGCGGA TCTAATTTCG GTGTTCTCGC CGACAATTTT
CCCGACTGGC ACAATCCCGC GTGCACGCCC AACATAGTCT CAGGAACGCA GAACAATGAG
CATTCCGGCG GAGCGCCGGT GTGTCCGATT TACGGCAACG GCAGCGACGC TCCGACACCG
GCAATTGATA AAACAAATGA GACGACCGGC GAGCCTGGCA CGATCAACAT CGACGGGAAG
ATGTCGATTC CCGCTGATAC GAGCACGCAT GGAAGGACGA TCGCAGACCA ACTGGTGGCG
CGCGGCATGA CGTGGAAGAG CTACCAGGAG AACCTGCCGG TTTTTGGCGC GGACTACGTC
AATTACAGTG ACGGCACGTT CACGAACAAT TCGATTTTGC CGGCGCCGTT GAAATCAAGC
GACATCGTCC AGTTGTATGC CGTGAAGCAC AATCCATTTG CGTACTTCCA GTCAGTACAG
GATGGGTACG ATCCGCAACT CAGTCTGAGT CAAGTGCGTG GTTTCGATGG GCAGGGCGGA
CTATACGAAG ATCTGGGCTC GGGAAAAGTT CCGAACTTCT CGCTGATTGC GCCGAACCAG
TGCAACGACC AGCACGGCCG CGGAAATGCA GGCCCCATCT GTAACTTCGA TCCGGCCGAC
AATGGCACCC AGACCGGACT GAACAACGCG CTCATCTACC AGGGCGACGT AACTGTCCAA
AGGCTCGTGA CTGCGATCCG GCGCTCGCCG GCATGGAACA GAGGCCGAAA CGCCATCGTT
GTGGTGTGGG ATGAGAATGA CTACTCGCTT GTGCCGATCA CGAACCAGGT TTTGTTCATT
GTTGACACCA ACTACGGCCG CCATGGCGTG CATAGTTCGC GGTTCTACGA TCACTTCTCG
CTGCTGAGAA GCCTGGAGGC TGGTTTCGGG CTCTCCTGCC TGAACCACGC GTGCGACGCA
CAATCGAAAG TGATGGGCGA TATCTTCTGA
 
Protein sequence
MKKIIASLTL AATAFANTSL FAAEGNAAPT GVLHLDHVWV ILRENHGYGQ LRNNPNAPFI 
NKYAESANTA DNYFAVAHPS LTNYLEIVGG SNFGVLADNF PDWHNPACTP NIVSGTQNNE
HSGGAPVCPI YGNGSDAPTP AIDKTNETTG EPGTINIDGK MSIPADTSTH GRTIADQLVA
RGMTWKSYQE NLPVFGADYV NYSDGTFTNN SILPAPLKSS DIVQLYAVKH NPFAYFQSVQ
DGYDPQLSLS QVRGFDGQGG LYEDLGSGKV PNFSLIAPNQ CNDQHGRGNA GPICNFDPAD
NGTQTGLNNA LIYQGDVTVQ RLVTAIRRSP AWNRGRNAIV VVWDENDYSL VPITNQVLFI
VDTNYGRHGV HSSRFYDHFS LLRSLEAGFG LSCLNHACDA QSKVMGDIF