Gene TM1040_1467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1467 
Symbol 
ID4077764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1567246 
End bp1569153 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content64% 
IMG OID638006778 
Producthypothetical protein 
Protein accessionYP_613462 
Protein GI99081308 
COG category[S] Function unknown 
COG ID[COG2898] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATGC GGGCCAGACG CAAGCAGACC TTTGCGAATT CCCTGAGGGT GGTGACGCCC 
CTGCTCATTA TGGCGGGCTG TTTGTTCGCC CTGACCCGAC AAGCGGATCT GCCGCATTTC
CACGACCTTC TGGGCCTGCT CACGCAGGTG CCCGCGCCGC ATTGGATCGG CGCCCTCGGG
GCAACTGTGC TCAGTTTCTG GGCTCTTGGC CGCTATGACG CGGTCGCACA CCGGCATTTG
CGCAGCGGAA TCGATGACCG GACCGCACGG CGCGCGGGCA TGGCCGCTAT CGCCTTTTCG
CAGGCCGTTG GATTTGGCTT GTTTTCGGGG TCCTTTGCAC GTTGGCGCCT GCTGCCGCAA
CTGAATCCAT TGCTTTCGGC GCAGCTGACC GGGTTTGTGG GCATCACCTT CATGACAGCC
CTCGCTGTCA TTTGCGGGAT CTTTCTGATC CTTGTGGGAC CTTCATGGGG GATGCGCCTT
GTGGGCGGGG GCATCCTGTT TGCCGCGATC GCCGCTGTTG GCCTGTGCTT CTTGCACCCA
GAGTGGCGCA TCCGTGGTTT GCGGCTCCGG TTCCCTTCTG TGCAGGCCAT CCTTGCGCTG
GCGCTTTGGA CCATGATGGA TGTGACCTTT GCCGGGGTCG CCCTCTGGCT GTTGCTGCCG
GTTGGACATG GCATCGGCCT TGATGTCTTG CTGACCGCCT ATTTTCTGGC CCTTGGACTG
GCGATCATCT CCTCCTCTCC GGGCGGAGCC GGGCCGCTGG AACTTGCAAT GCTCACGCTT
CTGCCCGGTG CCGATCCCGC GACGCTGGTG GCAGGACTTC TCGCCTTTCG GGCAGTTTAT
TATGCGCTGC CCGCGATGCT TGCGGGTGCT GTGTTGCTCT GGCCACGGCT GCTGCGCCAC
GGAAAGGCAA TGCCGGACCC CTGGGAGACT GGCGATCTGG GCTGCGATCT GCGGCCTGCT
GCCAGCCAGC CCTTCGTCCG CCCGCAGGCC GAAACGGCCG TGCTCTTGCA AAACGGCGGC
CATGTGATGG CCTTTGGGCT CAATCAAGTG GCGCTCCTTG ATAGCCCGCA GCTCTCTGCG
GTGCTATTTG ATCCAATCAG CGGCCGACAA GACGAGATCC CCGCCGCCCT GCGCGCCCAT
GCACTCTCGC GCAATGCGGC AGCTTGCTTT TACAAATGCA GTGCCCGCAC CGCGCTGGCA
GCCCGTCAGG AGGGTTGGAA GATCCTGAGA GTGGCTCAGG ACGCCATCCT TGCGCCGGAG
ACCTTCACCG TCGAGGGCTC CAAGCATCGC CAACTGCGTC GCAAACTGCG CCACGCCGAG
AAGGCCGGAC TGCAGGTGGA GCCCGCCTGG GGGACGCTGC CCTTGACCCA GATGGCCCAG
GTTGATGCCG CATGGACGCG CCAGCACGGC GGTGCCCGTG GCACGACGAT GGGCCAGTTC
GAGCCGGGAT ATGTGGCGAT CCAGATGACC TGTCTTGCCT GGCTTGAGGA CCGCCTTGTC
GGCTTCATGA CCTTTCACCG GGCGGCGGAT GAATGGTGCC TTGATCTGGT GCGCCAATTG
CCCGGGGCGC CCGATGGCAC CGCCCATGCT ATGATTTGCA CCGCGGTCGA GGCCGCGCGC
GATGCGGGCG TGCGCCGCCT GTCGCTTGCG TCGGTTCCCG ATCACCGCTT CAGCGCGCGC
TTTGATGGCG GCCTGCGTCA GTTCAAGTCC TGCTTTGCCC CCACCTGGGA GGCCCGATAC
ATGGCGGCGC CAAGCTGGGC TCAGATGGGG CTCGCGATTG CCGAAATGAC CCGGCTGGTG
CATCGTCCGG CGCGTCCGGA GGCTGCAATG GCGCAGATCG ACATGCTGCA GGACCCTATT
CCTGATGATA CTGTCGAAAA TGCAGTTGCG GCAAAACGGA CCGCGTGA
 
Protein sequence
MPMRARRKQT FANSLRVVTP LLIMAGCLFA LTRQADLPHF HDLLGLLTQV PAPHWIGALG 
ATVLSFWALG RYDAVAHRHL RSGIDDRTAR RAGMAAIAFS QAVGFGLFSG SFARWRLLPQ
LNPLLSAQLT GFVGITFMTA LAVICGIFLI LVGPSWGMRL VGGGILFAAI AAVGLCFLHP
EWRIRGLRLR FPSVQAILAL ALWTMMDVTF AGVALWLLLP VGHGIGLDVL LTAYFLALGL
AIISSSPGGA GPLELAMLTL LPGADPATLV AGLLAFRAVY YALPAMLAGA VLLWPRLLRH
GKAMPDPWET GDLGCDLRPA ASQPFVRPQA ETAVLLQNGG HVMAFGLNQV ALLDSPQLSA
VLFDPISGRQ DEIPAALRAH ALSRNAAACF YKCSARTALA ARQEGWKILR VAQDAILAPE
TFTVEGSKHR QLRRKLRHAE KAGLQVEPAW GTLPLTQMAQ VDAAWTRQHG GARGTTMGQF
EPGYVAIQMT CLAWLEDRLV GFMTFHRAAD EWCLDLVRQL PGAPDGTAHA MICTAVEAAR
DAGVRRLSLA SVPDHRFSAR FDGGLRQFKS CFAPTWEARY MAAPSWAQMG LAIAEMTRLV
HRPARPEAAM AQIDMLQDPI PDDTVENAVA AKRTA