Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3688 |
Symbol | |
ID | 4075657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 748946 |
End bp | 750166 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638005208 |
Product | hypothetical protein |
Protein accession | YP_611917 |
Protein GI | 99078659 |
COG category | [R] General function prediction only |
COG ID | [COG4671] Predicted glycosyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.112899 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGCCA GCCCCCTTCC CCGTTTCGGC CCTTCGGACC GTGGCCCCCG CATCCTGTTT TACAGTCACG ACACGTTTGG CCTTGGTCAC TTGCGGCGCT CGCGCGCCCT GGCAGCGGCG ATCACTTCGG CGGACCCCAA AGCCTCTGCA ATGATCCTGA CCGGCTCGCC AGTGGCGGGG CGATTTGCCT TTCCCAATCG CGTGGACCAC ATGCGCCTGC CCGGTGTGAT CAAGCGCGCT GACGGCTCAT ATGCCAGCCG CACAATGGGT ATGAGCATCG AAGAGACCAC GGAGCTGCGC GCGGGCCTCA TCCGCTCCAC CGCCGAGCAG TTTGCGCCCG ATATACTGGT GGTCGACAAG GAACCCACCG GATTTCGCGG CGAGTTGATC CCCACGCTTG ATCTCTTGCA GGAACGCGGC CAGGCGCGCC TTGTGCTGGG TTTGCGCGAT GTTCTGGACG AACCAGAAGT GCTGCGCGCC GAATGGGAGC GCAAATCCGC GCTCGAGGCG GCAGAGACCT ACTATGATGA AATCTGGATC TACGGTTTGC ATGACGTCTA TGATCCGACC GCAGGCCTGC CCCTGAGCAA AGAGACACAG GCGCGCATGC ACTGGACGGG CTATCTGCGC CGCGATCTCG GCGAAGTGGG CGAGCCTCCA GAGCAGCCTT ATGTTCTGAT TACCGCCGGC GGTGGCGGCG ATGGCGCGAT GATGGTGGAT CTTGCGATTT CCGCCTATGA ACGCGATCCC ACTCTCACGC CACGCGCGAT GCTGGTCTAC GGCCCGTTCC TGTCCGGTGA CACCCGCGCC GCATTTGAGG ATCGGGTCGC CGCCCTTGAC GGGCGGGTCA GCGCCGTCGG CTTTGAGAGC CAGATCGAGA CGCTGTTTGC CGGAGCGCAG GGCGTCATCT GCATGGGCGG TTACAACACG TTCTGCGAGG TGCTTTCGTT TGACAAACCG GCCGTGATTG TGCCGCGTAC CACGCCCCGG CTGGAGCAGT GGATCCGGGC CAGCCGTGCC GAGGAACTGG GCCTCGTGAC CATGCTCGAC GAAACCCGCG ATGGCTGGAC GCCCGAGGCG ATGATCGGTG CGATCCGCGC GCTGGAGCGC CAGCCTAACC CCTCAAAAGC GATCTCTGAC GGGCTACTTG ACGGACTCGA CTATGTGACC GAACGGGTCA ATGCACTGTT GCAACAACTC CCGCGTGAGG TCAGCGCATG A
|
Protein sequence | MSASPLPRFG PSDRGPRILF YSHDTFGLGH LRRSRALAAA ITSADPKASA MILTGSPVAG RFAFPNRVDH MRLPGVIKRA DGSYASRTMG MSIEETTELR AGLIRSTAEQ FAPDILVVDK EPTGFRGELI PTLDLLQERG QARLVLGLRD VLDEPEVLRA EWERKSALEA AETYYDEIWI YGLHDVYDPT AGLPLSKETQ ARMHWTGYLR RDLGEVGEPP EQPYVLITAG GGGDGAMMVD LAISAYERDP TLTPRAMLVY GPFLSGDTRA AFEDRVAALD GRVSAVGFES QIETLFAGAQ GVICMGGYNT FCEVLSFDKP AVIVPRTTPR LEQWIRASRA EELGLVTMLD ETRDGWTPEA MIGAIRALER QPNPSKAISD GLLDGLDYVT ERVNALLQQL PREVSA
|
| |