Gene TM1040_0124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0124 
Symbol 
ID4078729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp131939 
End bp132949 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content59% 
IMG OID638005411 
Productextracellular solute-binding protein 
Protein accessionYP_612119 
Protein GI99079965 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA AAGTCTTCAC TACCGTAATC GCCGCGAGCC TTGCTACGAC CGCCGTTGCT 
GACGGGGTTG TGAACCTGTA CTCGTCCCGC CATTACGACA CCGACGAACG CCTCTACACC
GACTTTGAAG AGGCTACCGG TATCACCATC AACCGCATCG AAGGCAAAGC CGATGAGCTG
GTCGCGCGCA TGCAGGCCGA AGGGGCCAAC TCTCCTGCGG ATGTTCTGAT CACCGTCGAC
ACCTCCCGCC TTGAGCGCGC GAAGAACGCC GGTGTGCTTC AGTCCATCGA CAGCGACATT
CTTGAAGAGC GCATCCCCGC CAACCTGCAA GATAGCGACA ACCAGTGGTT TGGTTTCTCT
CAGCGTGCCC GCATCGTCTT CTATGACAAG ACTGACGTGG CCAACCCGCC CGCAGACTAC
ATGGATCTTG CCAAGCCCGA ATACAAAGGC ATGGTCTGCC ACCGATCGTC TTCCAATGTC
TACTCCCAGA CCCTGCTGTC GGCCATCATC GAGAACCACG GTGAAGAGGC GGCACGCGAT
TGGGCAGAAG GCATCGTCGC AAACTTTGCC CGCGATCCGC AGGGTGGCGA TACCGACCAG
CTACGCGGCC TGATCTCCGG CGAGTGCGAC GTGTCGATTG CAAACACCTA TTATTTTGCC
CGTGCCCTGC GCAAAGACGT CAAAGGCCTC TCGGCTGAGA TCGAGAAGAT CGGCGTCGCC
TTCCCGGCTC AGGACGCTGA AGGCGCCCAC ATGAACCTCT CCGGCGCCGG TGTTGCAGCA
CATGCACCGA ACCGTGAGAA CGCCATCAAA TTCCTCGAGT ACCTGGCTTC CGATCAGGCG
CAGGAATATT TCTCCGGCGG CAACGATGAA TTCCCGGCGG TCCCGGGCGT CAGCAAGTCG
GAAAGCGTTG CACAGCTCGG CGAGTTCAAG GCCGACGACG TGGACCTCTC CAAGGTCGCC
AAGAACGTGC CGACCGCACA GAAGATCTTT AACGAGGTTG GCTGGGAATA A
 
Protein sequence
MKIKVFTTVI AASLATTAVA DGVVNLYSSR HYDTDERLYT DFEEATGITI NRIEGKADEL 
VARMQAEGAN SPADVLITVD TSRLERAKNA GVLQSIDSDI LEERIPANLQ DSDNQWFGFS
QRARIVFYDK TDVANPPADY MDLAKPEYKG MVCHRSSSNV YSQTLLSAII ENHGEEAARD
WAEGIVANFA RDPQGGDTDQ LRGLISGECD VSIANTYYFA RALRKDVKGL SAEIEKIGVA
FPAQDAEGAH MNLSGAGVAA HAPNRENAIK FLEYLASDQA QEYFSGGNDE FPAVPGVSKS
ESVAQLGEFK ADDVDLSKVA KNVPTAQKIF NEVGWE