Gene TM1040_3233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3233 
Symbol 
ID4075375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp234200 
End bp235180 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content60% 
IMG OID638004742 
Productphosphonate-binding periplasmic protein 
Protein accessionYP_611469 
Protein GI99078211 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3221] ABC-type phosphate/phosphonate transport system, periplasmic component 
TIGRFAM ID[TIGR01098] phosphate/phosphite/phosphonate ABC transporters, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.73019 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCAT TCATCACCCG TGCGATTGGC GCACTGGGCA TTCTGACGTG CTCCACCATC 
GCCGCTCAGG CGCAAGACTG CGCGGAGCGT GGCGCGCTTG ATCTGCAGTA TTGCGACGTC
AACGGCGATC TCGTCGCCGA CACGCCGACC GATCCGGCAG AGCTGAGCGA TCCGGATACG
CTCGTTTTTG CCTACACCCC GGTCGAGGAT CCGGCGGTCT ATGCTGACAT TTGGGAGCCG
TTCATCGAGC ATCTCGCTGA CGTCACCGGC AAGGACGTAC AGTTCTTTGC AGTGCAGTCG
AACTCCGCCG AGGTTGAGGC GATGCGCTCT GGCCGTCTGC ATGTGGCAGG TTTTTCGACC
GGACCGACGC CTTTCGCCGT GAACCTTGCA GGCGCCGTGC CGTTTGCAAT CATGGGTGCC
GAGGATGGCC AGTTTGGCTA TAAGCTGCAG GTCTTCACCC AGGCCGACAG CGACATCAAG
GACGTCTCTG ACCTTGCAGG CAAGCGCGTC GCGCATACCT CGCCCACTTC GAACTCTGGC
AACCAGGCAC CGCGCGCACT GTTCCCGGGC TTGGGCGTCG AGCCTGACAA AGACTACGAA
GTTGTCTATT CCGGCTCGCA CGACCAATCC ATGCTCGGCG TAGTTGCCGG GGATTACGAC
GCAGCCCCGG TTGCATCCGA GGTTGTGGAA CGCATGGCGG AACGCGGGCT TTATGATCCG
GCAGACGTGC GCATGATCTG GGAAAGCGAT CCGTTCCCAA CCACGTCTTT CACCATGGCG
CATAACCTTG ATCCCGCGCT GGCCGAAAAA GTCAAAGAGG CGTTCTTCAC CTTTGATTTT
GCTGGCACCG CGCTTGGCGA GGAATTTGAT GGCGTGTCGA AATTCGTGCC CATCACCTAC
AAGGACCAGT GGGCCGTGAT CCGTCAGATC CAGGCTTCCA ACGGCGTGGA ATACACGCCC
CAGGGTCTCG CCGGGAACTA A
 
Protein sequence
MTPFITRAIG ALGILTCSTI AAQAQDCAER GALDLQYCDV NGDLVADTPT DPAELSDPDT 
LVFAYTPVED PAVYADIWEP FIEHLADVTG KDVQFFAVQS NSAEVEAMRS GRLHVAGFST
GPTPFAVNLA GAVPFAIMGA EDGQFGYKLQ VFTQADSDIK DVSDLAGKRV AHTSPTSNSG
NQAPRALFPG LGVEPDKDYE VVYSGSHDQS MLGVVAGDYD AAPVASEVVE RMAERGLYDP
ADVRMIWESD PFPTTSFTMA HNLDPALAEK VKEAFFTFDF AGTALGEEFD GVSKFVPITY
KDQWAVIRQI QASNGVEYTP QGLAGN