Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0019 |
Symbol | |
ID | 4078682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 20200 |
End bp | 21900 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638005306 |
Product | glycosyl transferase family protein |
Protein accession | YP_612014 |
Protein GI | 99079860 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00457408 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACG TTGGCGTGAT CATGTTGGTG CATACGGCCC TCGAGCGGGC CGAGCAGGTC GTGCGCCACT GGACCGCCGC GGGCTGCCCA GTAGTGCTGC ATGTAGACCG GCAAGTGAGC AAACAGGTGT TTCAGGAGTT CTGCGCGCGC CTGAAAGACG ACCCCCTGGT ACGCTTTTCG CGCCGCCACA GATGTGAATG GGGGACCTGG GGCATTGTGG CCGCCTCTCA AAGCGCGTCT GAATTGATGC TCTCCGAGTT CCCGGATGTG AGCCACATCT ATCTGACCTC CGGGTCCTGC CTGCCGTTGC GCCCGGTGAA AGAGCTGCAA ACATACCTCG CGGCTCGGCC CGACATTGAT TTCATCGAAA GCGCCACAAC CTCGGATGTC CCTTGGATCG TTGGGGGTCT CAGCACCGAG CGCTTTACGA TGCGGTTTCC CTTTTCCTGG AAACGGCACC GCAAGCTGTT TGATCGCTAT GTCCGGCTGC AACGGAAACT GAAGTTGCGA CGTCGGATCC CGGCCGGATT GGTCCCCCAC ATGGGGAGCC AATGGTGGTG CCTTACTCGA CGCACTTTGA CGGCCATTCT CACGGATGCC GACAGAGCGC GGTATGACGC GTATTTTCGT CACGTCTGGA TCCCGGACGA GAGCTACTAT CAAACGCTTG CCCGGCTCCA TTCCGACAAT ATCGAGAGCC GGTCGCTGAC CCTTTCGAAG TTCGACTATC AGGGCAAACC GCATGTGTTC TATGACGATC ACCTGCAACT CTTGCGGCGT TCCGACTGTT TTGTCGCGCG CAAGATCTGG CCGCATGCAG AGCGGCTCTA TCAGGCCTTC CTGTGCCCAT CGGCCACTGA CATCAACCGC ACAGAGCCCA ATCCGGGCAA GATCGACAGG ATTTTTGCCA AAGCGGTGGA GCGGCGCATC CGAGGGCGCA AGGGGCTCTA CATGCAGAGC CGCTTTCCCG CCAAAGGAGT CGAAAGCCAG CAGACCTGTG GGCCGTATTC GGTGTTTCAA GGTTTTTCCG AATTGTTTGA GGATTTTGAA AGCTGGCTGG CGCGCGCAAC CGGTGCGCGC GTGCATGGCC ATCTTTTTGC GCCTGAACGG GCCGAATTCG CGGGGGAGCA GAGGCTGTTC AACGGCTGTC TGAGCGACAG TGCGCTCCTG CGCGATCGAA ACCCCCAGAG CTTTCTGTCG AACCTGATCT GGAACACACG CGGCGAGCGG CAGTGTTTCC AATTCGGACC CCATGACACC CAGAGCATCA ACTGGTTTAT CGCGCAGGAC AGCAATGCGC AGATCTCGGT GATCTCTGGG GCATGGTCTG TGTCCCTGTT TAAGACTAAC CGAAGCTTTA GCGATCTTCG GCGCGAGGCG GCAAAGCTGC AGCGCATTGA GGCTGAGCAT CTCAAGATCT TGCGAGGGCG CTGGACCCAT GCCCGGATCC GGGTCTGGAG CATGGCGGAG TTTATCGAGG CGCCGATGGA AAACCTGCAA ACCATCGTGG ATGAGATCGG TCCCATGTCC CGTAGCCATA TACTAGAAGC GCCGCAGATG GTGGATCTCA CAGGGTTTGG CCAGTTCCTG CAAAATCTGA AAAATCAGGG GATGCACCCC TATTTGATGG GGGATTTCCC CGCGCGCTCT GCGCCAAAGC AACCCATGCG CCAGGCGCGC AAAACCTATA TCGTGAAGTA A
|
Protein sequence | MSNVGVIMLV HTALERAEQV VRHWTAAGCP VVLHVDRQVS KQVFQEFCAR LKDDPLVRFS RRHRCEWGTW GIVAASQSAS ELMLSEFPDV SHIYLTSGSC LPLRPVKELQ TYLAARPDID FIESATTSDV PWIVGGLSTE RFTMRFPFSW KRHRKLFDRY VRLQRKLKLR RRIPAGLVPH MGSQWWCLTR RTLTAILTDA DRARYDAYFR HVWIPDESYY QTLARLHSDN IESRSLTLSK FDYQGKPHVF YDDHLQLLRR SDCFVARKIW PHAERLYQAF LCPSATDINR TEPNPGKIDR IFAKAVERRI RGRKGLYMQS RFPAKGVESQ QTCGPYSVFQ GFSELFEDFE SWLARATGAR VHGHLFAPER AEFAGEQRLF NGCLSDSALL RDRNPQSFLS NLIWNTRGER QCFQFGPHDT QSINWFIAQD SNAQISVISG AWSVSLFKTN RSFSDLRREA AKLQRIEAEH LKILRGRWTH ARIRVWSMAE FIEAPMENLQ TIVDEIGPMS RSHILEAPQM VDLTGFGQFL QNLKNQGMHP YLMGDFPARS APKQPMRQAR KTYIVK
|
| |