Gene Acid345_3950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3950 
Symbol 
ID4072422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4672498 
End bp4674567 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content58% 
IMG OID637985976 
Productphosphodiesterase/alkaline phosphatase D 
Protein accessionYP_593024 
Protein GI94970976 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3540] Phosphodiesterase/alkaline phosphatase D 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.028055 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.284943 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTCA ATCGGCGCGA GTTTCTCGCC ATGGCTGCGG TGCTGGGTGC GAACGCAGCA 
TGGGGACGCG GATTAGGTTC ACCATCGAGC GTTGCGTGGC AGGAGCGGCG CGATTTCTAC
CCGGAAGGTG TCGCGTCGGG TGATCCTGAT AGCAATAGCG TTCTGCTGTG GACGCGGCAT
CTGCCCGGCA GTTCACCGGT TGAAGAACTG ACTGTCGAAG TAGCACTTGA TGAATCGTTC
CGTAAAGTGA TCGCCAAGCA CACCGCCAAG GTCTCACCAG CAGCCGACTG GACTTGTCGT
GTACTGGTAG GCGGTCTGAA CCCGGGACAC GTTTATTGGT ACAGATTTAC CGACCACAAC
GGCTTCGGCA GTCGAATCGG TCGAACCATG ACGGCGCCGA GCGACGACGA TCCGCGCCCG
GTGAGATTCG CGTTTGTGAG TTGCCAGAAC GCCAACCTTG GTGCTCAGAA CGCTTACCGC
CGCATGATTT TTGAAGACGA GCGAGCCGAC GAAGCCAACC GCCTCGGGTT TGTGCTTCAC
CTCGGTGACT TCATCTACGA ACTCGTTTGG TATCCGGAAG ACAAGGCCAC CTATTACGAT
CGCCAGGTCC AGGACATCGT TCGCTACCCG AAAGGTGAAA AGGTCAGCAA CTTTCATATT
CCTGTGGATG TAGATGACTA TCGCGCTGTC TACCGCAGCT ACCTGCACGA TCCCGATCTC
CAGGACGCGC GCGCGCGATT TCCGTTTGTT CCCATGTGGG ACAACCACGA GTTCAGTTGG
CAAGGATGGC AAAGCTTCGA GGTGTTCGAC GGAAAAACGA GACCGGCGCA GACGCGCAAG
GTAGCGGCCA TGCAGGCGTT CTTCGAGTAT CAGCCGGCGC GAATGGCGAA GCCCAGCGGT
CCTTCGCTCG AACAGTTCAA TCCGCCGACC GTCGCTGATG TCGCAGTGAC GAAATTCGAC
GACAACGGTC TCGGACAAGA ACCGAATAAC CTCGCAGCCA TCAACAGCCT TAAGGGCTAC
AGGTCGTTGC GCTGGGGGAA GCATGTCGAA TTGATCATCA CCGACGAACG AAGCTACCGT
TCGCCGGACC CCGGAACCGA CCTCGACGGC GATGCGCTGT TCAACAAGGA TTTCTCCGAG
TTCATTCCGG AAGAAGTCAT GGAGATCATT GACGCTGGTC GCGAATATAA CGGCGGCAAA
CCTCCGGACG CGATTCCCTT TGGCGATAAG AAGGTCGCGA ACACGCAAAA GAACCGCCCG
CCCCAGACCA TTCTCGGCGC AGAGCAGAAA AAGTGGTTTT TTGATCAGTT GCGAGAATCG
AAGGCGACCT GGAAGATTTG GGGAAGCACT ACGGCCACGC TTCCGCAGCG GGCAGATCCG
CAAAACCTGC CGCCAGGCAT TACCTCCAGT AAATGGCCCG GCGAAGCGTA TGCGAGCATG
GGCGGCGCCG ATATGAGCAC GGCCTCCCAC GAACTAGGCC AAATCTATGA CTTCGTCCGG
CAGCATGAAA TCACAGGCTT CGCTGCCGTC GCGGGCGACA GGCACAGCTT CTGGGCAGGG
TTAGCTTCGA AATCTCTTCC CCCAAAACCA TTCGATCCCG TCGGCGTGGC ATTTGTCGTG
GGTTCCATTT CGTCTCCCGG CGGTTTGGAA GCTTATGAAC ACAAAATGGC AAAAGACGAG
CCCTTGCGCT CACTCTTTAT AGGTCAAGCC CCCGCAGACA CCAAGCCGCA GCCGACCATC
AACATGCTCT TAATGCACGG CGTTCGCTCC TGCCTCGAAT ACTGCAAGAG CGGTGACCTC
GCGAAAGCAC GCGCACTCTC AAATCCCAGC ATGGCGCCGC AGATGTCGTT CGTCGACATC
GGCGGACACG GCTATGCGGT CGTCCAGGCC GCAGCGAATG AGCTCGAAAC AGAATTCGTT
TGCATTCCGC GCCCGGTGCG TCGCATTGAC AGCCCGGATG GAGGTCCAAT CCTCTACCGC
GCGCGTAACC ATAGCCGGCT GTGGCGAGCG GGCGAGCCGC CAAAGCTAGA GAGCACAGTG
GTGGAAGGAG AAGCACGATT TTCGGTTTGA
 
Protein sequence
MTFNRREFLA MAAVLGANAA WGRGLGSPSS VAWQERRDFY PEGVASGDPD SNSVLLWTRH 
LPGSSPVEEL TVEVALDESF RKVIAKHTAK VSPAADWTCR VLVGGLNPGH VYWYRFTDHN
GFGSRIGRTM TAPSDDDPRP VRFAFVSCQN ANLGAQNAYR RMIFEDERAD EANRLGFVLH
LGDFIYELVW YPEDKATYYD RQVQDIVRYP KGEKVSNFHI PVDVDDYRAV YRSYLHDPDL
QDARARFPFV PMWDNHEFSW QGWQSFEVFD GKTRPAQTRK VAAMQAFFEY QPARMAKPSG
PSLEQFNPPT VADVAVTKFD DNGLGQEPNN LAAINSLKGY RSLRWGKHVE LIITDERSYR
SPDPGTDLDG DALFNKDFSE FIPEEVMEII DAGREYNGGK PPDAIPFGDK KVANTQKNRP
PQTILGAEQK KWFFDQLRES KATWKIWGST TATLPQRADP QNLPPGITSS KWPGEAYASM
GGADMSTASH ELGQIYDFVR QHEITGFAAV AGDRHSFWAG LASKSLPPKP FDPVGVAFVV
GSISSPGGLE AYEHKMAKDE PLRSLFIGQA PADTKPQPTI NMLLMHGVRS CLEYCKSGDL
AKARALSNPS MAPQMSFVDI GGHGYAVVQA AANELETEFV CIPRPVRRID SPDGGPILYR
ARNHSRLWRA GEPPKLESTV VEGEARFSV