Gene Paes_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_0101 
Symbol 
ID6458577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp98669 
End bp99727 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content56% 
IMG OID642724088 
Productphytase 
Protein accessionYP_002014808 
Protein GI194332948 
COG category[I] Lipid transport and metabolism 
COG ID[COG4247] 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACC ACACGCTGAA ATACACGCTG GCAACACTCA TCGCGGCGCT CTCGCTCCCC 
GGGTGCAACA CCGGGACCTC CCCCCACGAA GCCCGACCGC TGATCGTTAC CGAACAGGTC
CCGAACGACA GTGACGATCC TGCCATCTGG ATCAATCGCG AAGATCCGTC GAAAAGCCTT
GTCCTCGGTA CCGACAAAGA TGCAAACGGA GGCGTCTATG TGTTCGACCT CAAAGGCCGG
ATCCTGAAAG AAAAAACGGT AACAGGCCTT GCCCGCCCGA ACAACATCGA TATAGGTTAC
GGCCTGATGC TTGGGGGAAA ACCGGTCGAT ATCGCCGTTG TCACCGAACG GCTGACATCA
AAACTCCGGG TGTTCGCCCT TCCCGGCATG GAACCGATCG ACAATGGCGG CCTGCCGGTC
TTTGAAAACC AGAAGCTTGC CGCCCCGATG GGCATCGCAC TCTATAAACG ACCATCCGAC
AATGCCATGT TTGCCGTCGT CAGCAGAAAA CAGGGGCCTC AGGACGGAAC CTATCTCTGG
CAGTACCTTC TGGAGGATGA CGGCAGCGGC CAAGTCATAG CGACGAAGGT GCGCGAATTC
GGAGCATGGA GCGGAAAAAA GGAGATCGAA GCCGTAGCCG TCGACAACGA GGCAGGAAGA
ATCTATTACT CGGATGAGGG TTTCGGCATC AGATCGTACC GCGCCGATCC TGAACATCCG
GACGCCGGCG CTGAACTGGC TCTTTTCGCA ACTGAAGGGA TCACACGGGA CCACGAAGGA
ATCGCCATTG TCTCTGACAG CAACAACGGG GGTTGGATCA TCGTCTCGGA CCAGTCCGCA
GGAGAGCTCC ACCTCTACTC AAGAAACGGA GGCACTCCTG ATACAATGGA GCACCATACC
CTGAAGCGTG TCGTGAAAAC CGCAGCCATT GAGACCGACG GCATTGAAGC CGCACCGAAA
CTCAACGGAA CGGGCTTCCC GAAAGGTCTT TTCGTTGCCA TGTCTGACGA CAGGACATTC
CAGTACTATT CGCTGGAGGA TATCATCGGA ACACAGTAA
 
Protein sequence
MKNHTLKYTL ATLIAALSLP GCNTGTSPHE ARPLIVTEQV PNDSDDPAIW INREDPSKSL 
VLGTDKDANG GVYVFDLKGR ILKEKTVTGL ARPNNIDIGY GLMLGGKPVD IAVVTERLTS
KLRVFALPGM EPIDNGGLPV FENQKLAAPM GIALYKRPSD NAMFAVVSRK QGPQDGTYLW
QYLLEDDGSG QVIATKVREF GAWSGKKEIE AVAVDNEAGR IYYSDEGFGI RSYRADPEHP
DAGAELALFA TEGITRDHEG IAIVSDSNNG GWIIVSDQSA GELHLYSRNG GTPDTMEHHT
LKRVVKTAAI ETDGIEAAPK LNGTGFPKGL FVAMSDDRTF QYYSLEDIIG TQ