Gene TM1040_0019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0019 
Symbol 
ID4078682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp20200 
End bp21900 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content58% 
IMG OID638005306 
Productglycosyl transferase family protein 
Protein accessionYP_612014 
Protein GI99079860 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00457408 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG TTGGCGTGAT CATGTTGGTG CATACGGCCC TCGAGCGGGC CGAGCAGGTC 
GTGCGCCACT GGACCGCCGC GGGCTGCCCA GTAGTGCTGC ATGTAGACCG GCAAGTGAGC
AAACAGGTGT TTCAGGAGTT CTGCGCGCGC CTGAAAGACG ACCCCCTGGT ACGCTTTTCG
CGCCGCCACA GATGTGAATG GGGGACCTGG GGCATTGTGG CCGCCTCTCA AAGCGCGTCT
GAATTGATGC TCTCCGAGTT CCCGGATGTG AGCCACATCT ATCTGACCTC CGGGTCCTGC
CTGCCGTTGC GCCCGGTGAA AGAGCTGCAA ACATACCTCG CGGCTCGGCC CGACATTGAT
TTCATCGAAA GCGCCACAAC CTCGGATGTC CCTTGGATCG TTGGGGGTCT CAGCACCGAG
CGCTTTACGA TGCGGTTTCC CTTTTCCTGG AAACGGCACC GCAAGCTGTT TGATCGCTAT
GTCCGGCTGC AACGGAAACT GAAGTTGCGA CGTCGGATCC CGGCCGGATT GGTCCCCCAC
ATGGGGAGCC AATGGTGGTG CCTTACTCGA CGCACTTTGA CGGCCATTCT CACGGATGCC
GACAGAGCGC GGTATGACGC GTATTTTCGT CACGTCTGGA TCCCGGACGA GAGCTACTAT
CAAACGCTTG CCCGGCTCCA TTCCGACAAT ATCGAGAGCC GGTCGCTGAC CCTTTCGAAG
TTCGACTATC AGGGCAAACC GCATGTGTTC TATGACGATC ACCTGCAACT CTTGCGGCGT
TCCGACTGTT TTGTCGCGCG CAAGATCTGG CCGCATGCAG AGCGGCTCTA TCAGGCCTTC
CTGTGCCCAT CGGCCACTGA CATCAACCGC ACAGAGCCCA ATCCGGGCAA GATCGACAGG
ATTTTTGCCA AAGCGGTGGA GCGGCGCATC CGAGGGCGCA AGGGGCTCTA CATGCAGAGC
CGCTTTCCCG CCAAAGGAGT CGAAAGCCAG CAGACCTGTG GGCCGTATTC GGTGTTTCAA
GGTTTTTCCG AATTGTTTGA GGATTTTGAA AGCTGGCTGG CGCGCGCAAC CGGTGCGCGC
GTGCATGGCC ATCTTTTTGC GCCTGAACGG GCCGAATTCG CGGGGGAGCA GAGGCTGTTC
AACGGCTGTC TGAGCGACAG TGCGCTCCTG CGCGATCGAA ACCCCCAGAG CTTTCTGTCG
AACCTGATCT GGAACACACG CGGCGAGCGG CAGTGTTTCC AATTCGGACC CCATGACACC
CAGAGCATCA ACTGGTTTAT CGCGCAGGAC AGCAATGCGC AGATCTCGGT GATCTCTGGG
GCATGGTCTG TGTCCCTGTT TAAGACTAAC CGAAGCTTTA GCGATCTTCG GCGCGAGGCG
GCAAAGCTGC AGCGCATTGA GGCTGAGCAT CTCAAGATCT TGCGAGGGCG CTGGACCCAT
GCCCGGATCC GGGTCTGGAG CATGGCGGAG TTTATCGAGG CGCCGATGGA AAACCTGCAA
ACCATCGTGG ATGAGATCGG TCCCATGTCC CGTAGCCATA TACTAGAAGC GCCGCAGATG
GTGGATCTCA CAGGGTTTGG CCAGTTCCTG CAAAATCTGA AAAATCAGGG GATGCACCCC
TATTTGATGG GGGATTTCCC CGCGCGCTCT GCGCCAAAGC AACCCATGCG CCAGGCGCGC
AAAACCTATA TCGTGAAGTA A
 
Protein sequence
MSNVGVIMLV HTALERAEQV VRHWTAAGCP VVLHVDRQVS KQVFQEFCAR LKDDPLVRFS 
RRHRCEWGTW GIVAASQSAS ELMLSEFPDV SHIYLTSGSC LPLRPVKELQ TYLAARPDID
FIESATTSDV PWIVGGLSTE RFTMRFPFSW KRHRKLFDRY VRLQRKLKLR RRIPAGLVPH
MGSQWWCLTR RTLTAILTDA DRARYDAYFR HVWIPDESYY QTLARLHSDN IESRSLTLSK
FDYQGKPHVF YDDHLQLLRR SDCFVARKIW PHAERLYQAF LCPSATDINR TEPNPGKIDR
IFAKAVERRI RGRKGLYMQS RFPAKGVESQ QTCGPYSVFQ GFSELFEDFE SWLARATGAR
VHGHLFAPER AEFAGEQRLF NGCLSDSALL RDRNPQSFLS NLIWNTRGER QCFQFGPHDT
QSINWFIAQD SNAQISVISG AWSVSLFKTN RSFSDLRREA AKLQRIEAEH LKILRGRWTH
ARIRVWSMAE FIEAPMENLQ TIVDEIGPMS RSHILEAPQM VDLTGFGQFL QNLKNQGMHP
YLMGDFPARS APKQPMRQAR KTYIVK