Gene Athe_2754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2754 
Symbol 
ID7408324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2904090 
End bp2905733 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content36% 
IMG OID643717110 
ProductAlkaline phosphatase 
Protein accessionYP_002574579 
Protein GI222530697 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1785] Alkaline phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATG TTTTTTTGAG AAGAAAAAAT ATACTTATTT CAGTAATTTT AATCCTATGT 
CTAATTTTAT CAACTCTTTC ATTTGCTCAA CAAGATATAA ACGATAATGA AAATAGAGTC
AAAAATGTTA TTTTAATGAT TCCTGATGGT ATGACAATAG CCCACATAAC GCTTGCTCGC
TGGTATCAAG ATGGAAAACC TTTGGCAATG GACGAACTTG CCTGTGGGCT TGTAAGAACA
TATTCAGCAA ACAATCCAAT TACAGACTCA GCACCAGCTG CAACAGCTTA TGCTACAGGG
TATAAAACCC AAAACAGGTA CCTTTCCATA TATCCTGAGA TTGTTTCAAT GCCAGGAGTT
GGGCAAGTAG AGGAAAAAGA TTTTTATAAG CCTATAATAA CTATATTGGA AGCAGCTAAA
AATTTAGGAA AATCAACAGG TCTTGTTTTC ACATGCCAGT TTCCGCACGC AACACCTGCT
GCTTTTGCCT CACATACAGA CAACAGAAAT GATTATGAAT CTATAGCCGA GCAGATGGTT
TATAACCATG TTGATGTTGT GCTCGGTGGA GGGTACAAAT ACATTGACAA AAATCAGAGA
AAAGACAAGG AAGATTTGGC AGGTTACTTG AAGCAAAATG GCATTTTTGT TACAACAAAC
TGGCATGATG CAAAAAACTT TTTAGGAAAG AAAATCTGGG GACTTTTTGC CCAGGACGCA
ATGCAATATG ATTTTGACAG AAATGGCACA GGTGAACCGT CTTTAGCAGA GATGACTCAA
AAAGCACTCC AGATTCTGTC CAAGAATAGA AACGGATTTT TCTTAATGGT TGAAGGCAGC
CAAATTGACT GGGCATCACA TGCAAACGAC CCCGTTGGAG TTGTATCAGA GGTTTTGGCA
TTTGACAAGG CTGTCAAAGT AGCTTTAGAC TTTGCAAAGT CAAGAGATGA TACAGCTGTA
ATTATTGCAC CAGACCATAC CAATGGTGGT ATGACACTTG GAATAGGAAC AACTTCAATT
GACAGTATAC TATTGAATCA TTTTCTGAGG TACATCAGAG AAGCAACAAG AACGGCAGCA
GTAGCAGAAA AAATACTTGG AAACAATAGA ACAGATGAAA ATATCAAAAA GGTAGTATCA
CAGTATTATG GCATTGACAA TCTCACACAG GATGAAATAA ATGCAATCAA AAACGCTCCT
CAGGGAAGGT TAAACTATGT ATTAGGTCCA ATTATAAGCA AGCGATCATA CATTGGTTGG
ACTTCAAATG AACACACAGG TGAAGAGGTT GTTTTATATG CATATCACCC AAAAGATTAC
ATTCCAAGAG GTGTTATTGA AAACACAGAG GTCTGTGACT ACATGGCAGA AATTCTTGGT
ATTGACCTTG GAAGCTTTAA TGAAAATGCA TATATCTCAA ACATTGACTT AGAAGAAAAA
GGATATGATG TTTCCATTGA CACAAGTAAC CCGTCAAATA TTCAGCTTGT AATAAACAAA
GGAAGTAAAA CGTACATCAT CCCGCAAAAC AAAAATGTTG TTTTAGAAGG TTATAATCAG
TATAAGTTAA AATATGTAAG TGTGTACATT CCAAGTGCGA AAAGGTTCTT TGTATCAAGT
GAGATAGAAA GTTTGATTAA ATGA
 
Protein sequence
MKNVFLRRKN ILISVILILC LILSTLSFAQ QDINDNENRV KNVILMIPDG MTIAHITLAR 
WYQDGKPLAM DELACGLVRT YSANNPITDS APAATAYATG YKTQNRYLSI YPEIVSMPGV
GQVEEKDFYK PIITILEAAK NLGKSTGLVF TCQFPHATPA AFASHTDNRN DYESIAEQMV
YNHVDVVLGG GYKYIDKNQR KDKEDLAGYL KQNGIFVTTN WHDAKNFLGK KIWGLFAQDA
MQYDFDRNGT GEPSLAEMTQ KALQILSKNR NGFFLMVEGS QIDWASHAND PVGVVSEVLA
FDKAVKVALD FAKSRDDTAV IIAPDHTNGG MTLGIGTTSI DSILLNHFLR YIREATRTAA
VAEKILGNNR TDENIKKVVS QYYGIDNLTQ DEINAIKNAP QGRLNYVLGP IISKRSYIGW
TSNEHTGEEV VLYAYHPKDY IPRGVIENTE VCDYMAEILG IDLGSFNENA YISNIDLEEK
GYDVSIDTSN PSNIQLVINK GSKTYIIPQN KNVVLEGYNQ YKLKYVSVYI PSAKRFFVSS
EIESLIK