Gene TM1040_0855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0855 
Symbol 
ID4076030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp907552 
End bp909198 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content62% 
IMG OID638006153 
ProductNa+/Pi-cotransporter 
Protein accessionYP_612850 
Protein GI99080696 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1283] Na+/phosphate symporter 
TIGRFAM ID[TIGR00704] Na/Pi-cotransporter 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGGCAT TTTTGATTCA GATCGCGGGC GCTGCGGCGC TATTGTTGTG GGCGGTGCGT 
CTGGTGCGCA CCGGCGTGGA GCGAGCTTTT GCCACACAGC TGAGACAGGG GCTAAGGCGC
GCCTCTGACA ATCGCCTGCT GGCCGCTTTT TCGGGGCTGG TGTCGGCGAT GATGCTGCAG
AGCTCCACCG CCGTGGCCAT GCTGGTATCG AACTTTGCCA GCAAGGGCGC GCTCAGTATC
GCGGTTGGGA TTGCCATGTT GCTCGGGGCG GATCTTGGCT CTGCCCTTGT GGCGCAGATC
CTTCTGGCCC GTCCGGGCAT TCTGGTGCCT GCGCTCCTTC TGGTGGGGGT GGCGCTGTTC
CTGCGCAGCG AACAGCGTAA ACTGCGCCAG ACCGGGCGTA TCCTGATCGG GCTTGCGCTG
ATTTTTGTCT CGCTCGACAT GCTGCGCACC GCCACAGCAC CAATCATCGA GAGCAAGGGC
GCTGTCGCGG TACTGGGGTA TCTCGATTCC GACCTTTTCA CGGCCTTTAT TCTCGGGGCG
ATCTTTGCCT GGGTGGTGCA CTCCAGCGTT GCGGCGATCT TGTTGTTTGT CACCATGGTC
GCCCAGGGCG CGCTGCCGCC GATGGCGGCG ATGGCGATGG TGCTTGGCGC CAACCTTGGC
GGCGCCATGA TCGCCTTTGT GCTGACGCTT GCTGCCGATG TGCGCGCGCG GCGTATCGTG
AGCGCCAATC TCGTCTTGCG AGGCGGCGGA GCGCTTGCGG TGCTCTTGGC GCTGAACCTG
AACCTTCTGG ACCCCACGTG GCTCGGGCAG GCGGCTGCCG CTCAAGTGAT GGGGCTGCAT
CTGGTCTTCA ATCTTGTGCT CTGTCTGCTG GCGCTGCCCT TTGTGGGGCT CATCGCCCGC
ATGAGCGAGC GGGTGATGCG CGACCCCAAT GTGCAATCTC TGGGGCGCGG GATGACGGCC
TTGGACCCCG CCGCGCTCGA GGACCCGCCG CGTGCGCTGA ACTGTGCCGC GCGCGAGCTG
TTGCGCATGG GTGAAGTCAT CGAGAAAATG CTGGTTGCGG CCATGGGGCT CTATGAAGTC
TGGGATGACC GCGTCGCCGA GGCGATCCGT GACGAGGATA AGGCACTGCG GCAGATGCAT
CTCGACCTCA AGCTCTATCT GGCGGGGATG AACCGTCACA AGCTGGATGA CAGCGCCGTC
AATGGCGCGA TCGAGGCATC GCTCATGGCC GCCAACCTGG AGGCGGCCGC CGATCAGGTG
GCCCGCAAAA TGACCGATCT GGCGCGCAAG CTCTCGCTTG AAGGGATCAG CTTTTCCAAG
AGTGGCCAGC AGGAAATCAA CGATTTCATG GACCGCATTC AAGCCAATGT TCAACTCGCG
CTAACGGTCA TGATGACCCG CAACCCCGAT GATGCGCGCG AGCTGGTGGC CGAAAAGGAA
AGCATCCGAA CCGCCGAACA AAAACTGCAG AGCAAACACC TGTCGCGCCT TCGCGAAGGC
GTCAGCGAAA GCATCGAGAC CAGCGCAATC CATCAGGAAA CCTTGCGCAT TCTCAAACAG
GTCAATACCT CTTTTGCGAT GGCAGGGTAT CCGATCCTCG ACGAATCCGG TGATCTGCTG
TCCAGTCGCC TGTCCTCGCA GGGCTGA
 
Protein sequence
MTAFLIQIAG AAALLLWAVR LVRTGVERAF ATQLRQGLRR ASDNRLLAAF SGLVSAMMLQ 
SSTAVAMLVS NFASKGALSI AVGIAMLLGA DLGSALVAQI LLARPGILVP ALLLVGVALF
LRSEQRKLRQ TGRILIGLAL IFVSLDMLRT ATAPIIESKG AVAVLGYLDS DLFTAFILGA
IFAWVVHSSV AAILLFVTMV AQGALPPMAA MAMVLGANLG GAMIAFVLTL AADVRARRIV
SANLVLRGGG ALAVLLALNL NLLDPTWLGQ AAAAQVMGLH LVFNLVLCLL ALPFVGLIAR
MSERVMRDPN VQSLGRGMTA LDPAALEDPP RALNCAAREL LRMGEVIEKM LVAAMGLYEV
WDDRVAEAIR DEDKALRQMH LDLKLYLAGM NRHKLDDSAV NGAIEASLMA ANLEAAADQV
ARKMTDLARK LSLEGISFSK SGQQEINDFM DRIQANVQLA LTVMMTRNPD DARELVAEKE
SIRTAEQKLQ SKHLSRLREG VSESIETSAI HQETLRILKQ VNTSFAMAGY PILDESGDLL
SSRLSSQG