Gene Clim_1122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1122 
Symbol 
ID6355764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1224512 
End bp1225570 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content49% 
IMG OID642668739 
Productphytase 
Protein accessionYP_001943170 
Protein GI189346641 
COG category[I] Lipid transport and metabolism 
COG ID[COG4247] 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGATAG ATTTTATAAG GTGTATTGCC GTTGCGGTCT TTGTGGCGTT ATCAGCATGT 
TCGGGAATGA ACCGGGATCT GCCCGAAAAT GCCGTCAACC CGGTTGTCGT TTCCGAAAAA
GTGCCGCATG ATGCGGATGA TCCCGCTATC TGGGTTAACC ATGCCGATCA TGACGGGAGC
ATGATACTCG GGACTGACAA GCATGAGAAT GGGGCGGTTT ACGTTTTCGA TTTACAGGGA
CGCATTATCG CAAATAAATG CGTTCACGGG CTTCAGCGTC CAAACAATAT CGATGTCGAA
TACGGACTTC TTCTGAACGG TAAGCCGGTC GATATTGCTG TAGTAACAGA ACGTATGAGC
GGGAAGCTTC GGGTTTTTAC CCTGCCGGAT ATGAAGGCTG TCGATAAGGG GGGGATTCCC
GTATTTACCG GTGAACGGGA TAATGCGCCA ATGGGTGTTG CATTATATAA ACGGAAGCAT
GATGGAGCGA TCTATGCAGT TGTAAGCCGC AAACAGGGGC CGGTTGATGG AACGTATCTC
TGGCAGTACA GGCTTGAAGA TAGCGGAAAT GGTTTTGTGC GGGCCTCTCT TGTAAGAAAA
TTCGGCATAT GGAGTGGTAA AAAGGAGATC GAGGCTGTAG CGGTCGATGA CCGTTCAGGT
TTTGTCTATT ATTCCGATGA GGGGGTTGGT GTCAGAAAAT ATCATGCCGA TCCTGATATG
AAGGGAGGAG AGAAAGAGCT TGCCCTGTTT GCCACCGATG GTTTTACAAA AGATCATGAA
GGTATTGCGG TCTTTTCGAC GACAGATGGC AGCGTTGTCA TCATTTCAGA TCAGGGGGCC
GGTCAGCTTC ATCTGTTCAG GGAGTCCGGG TCTGCATCAG ATGGCAGTAA GGGAGTCCGA
CGGATAGGGA TTGTAAAAAC AGCGGCAGTT GACACAGACG GGATCGAGGC CTCTTCGGTA
CTTTCTACAG CCGGTTTTCC TGCCGGTATT CTTGTGGCTA TGTCGGATGA CCGTACGTTC
CAGTATTATT CGCTCAAAGA TCTGGGTATT CTACCCTGA
 
Protein sequence
MRIDFIRCIA VAVFVALSAC SGMNRDLPEN AVNPVVVSEK VPHDADDPAI WVNHADHDGS 
MILGTDKHEN GAVYVFDLQG RIIANKCVHG LQRPNNIDVE YGLLLNGKPV DIAVVTERMS
GKLRVFTLPD MKAVDKGGIP VFTGERDNAP MGVALYKRKH DGAIYAVVSR KQGPVDGTYL
WQYRLEDSGN GFVRASLVRK FGIWSGKKEI EAVAVDDRSG FVYYSDEGVG VRKYHADPDM
KGGEKELALF ATDGFTKDHE GIAVFSTTDG SVVIISDQGA GQLHLFRESG SASDGSKGVR
RIGIVKTAAV DTDGIEASSV LSTAGFPAGI LVAMSDDRTF QYYSLKDLGI LP