Gene Cphamn1_1536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1536 
Symbol 
ID6375214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1660434 
End bp1661504 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content48% 
IMG OID642684029 
Productphytase 
Protein accessionYP_001959943 
Protein GI189500473 
COG category[I] Lipid transport and metabolism 
COG ID[COG4247] 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.4705 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG TTTTACCTAT CCTTGCATTG ATTGGAGTCG GCGGAATACT CGCATCCTGT 
CATAACAACA TCGAACAAAC GATCGAACCT TTGGCCGTCA CTGATTCCGT TCGTCATGAC
AGTGAAGATC CGGCCATATG GATAAACAAA GAAAATCCAT CAAAAAGCCT TGTCCTTGCA
ACAGACAAGC ATAAAGACGG TGCGTTGTAT GTTTTCGACC TTGAAGGCAA AGCAATACAC
AAAAAAACCA TCAAAGGCCT TGCAAGACCC AACAATGTTG ATGTGGGCTA CGGCTTCCCT
CTGAACGGGA ACAACGTCGA CATAGCGGTC GTCACAGAAA GGCTTGAAAA CCGGATACGT
ATATTCCGGT TACCCGATAT GACCGCTATC GACAATGGAG GCGTCCCGGT CTTTCAAGGG
GAAGAGTACA ACGCCCCGAT GGGTATAGCT TTTTACAAAA GACCTTCGGA CGGAAAAATG
TATGTTATTG TCAGCCGGAA ACAGGGGCCA ACCGACGGCA CATACCTCTG GCAGTATCTC
CTCGAAGACA GTGGTAATGG ATATATTACC GCTCATAAAG CAAGGGCGTT CGGACAATGG
AGCGGCCAAC AGGAAATCGA GGCTGTTGCA GTAGACAATG AGCTGGGTTA TGTCTATTAT
TCCGATGAAT GTGTGGGAGT CAGGAAATAT CATGCCGATC CTGAAACCCC GGATGCGAAC
CGGGAACTAT CCCTGTTTGC CACCAAAGGC TTTGCCGAAG ACCATGAAGG CGTCGCCATA
TGGAAAACCG GTGAAACCGA CGGCTATATT ATCGTTTCGG ATCAGGCAGC AGGAAAATTA
CGGCTTTATC CCAGAAACGG TAAAGACCTT CATGAACCTC ACAAGCACGA GCTGGTCGGT
ATTGTGCAAA CCGGCGCAAA AGAAACCGAC GGAATCGAAG CTGCTGCAGA GCTCGTAACA
GAGGAATACC CATCAGGCCT TCTGGTGGCC ATGTCCGACG ACAAAACCTA TCACTACTAC
TCCCTTAAGG ATATTCCCGA TAAACAAGAT AAGCCCCGTC AATCGCACTA A
 
Protein sequence
MKKVLPILAL IGVGGILASC HNNIEQTIEP LAVTDSVRHD SEDPAIWINK ENPSKSLVLA 
TDKHKDGALY VFDLEGKAIH KKTIKGLARP NNVDVGYGFP LNGNNVDIAV VTERLENRIR
IFRLPDMTAI DNGGVPVFQG EEYNAPMGIA FYKRPSDGKM YVIVSRKQGP TDGTYLWQYL
LEDSGNGYIT AHKARAFGQW SGQQEIEAVA VDNELGYVYY SDECVGVRKY HADPETPDAN
RELSLFATKG FAEDHEGVAI WKTGETDGYI IVSDQAAGKL RLYPRNGKDL HEPHKHELVG
IVQTGAKETD GIEAAAELVT EEYPSGLLVA MSDDKTYHYY SLKDIPDKQD KPRQSH