Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2488 |
Symbol | |
ID | 4076853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2628115 |
End bp | 2629692 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 638007812 |
Product | glycosyl transferase family protein |
Protein accession | YP_614482 |
Protein GI | 99082328 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.80194 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.91417 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAAAA TCGCCTATAT TCTGCTCTGT CACAAGGATC CGGAGGCCAT CATTCAGCAG GCCAACCGGC TCACTGCGAC CGGCGATTTT ATGGCGATCC ACTTTGATGC CCGTGCCAAG ACCCGCGACT ATCGTGCTAT CCGCTCAGCG CTTTCAGACA ATCCCAACGT GACGTTCGCC AAGCGTCGGG TGAAATGTGG TTGGGGCGAA TGGTCCTTGG TGCGCGCCAC GCTCAATGCG CTGGAGGCAG CTGTCGACGA GTTCCCGCGC GCAACGCATT TTTATATGCT GTCGGGGGAT TGCATGGCGA TCAAGACGGC GCAATACGCG CGCGCCTTTC TCGATCAGCA CGACAAGGAT TTCATCGAGA GCTTTGATTT CTTCGAGAGC GACTGGATCA AGACCGGGAT GAAGGAGGAC CGGCTGATCT ATCGCCACTA CTTTAACGAA CGCACCCAGA AACGCCTGTT CTATGCCGCG TTCGAGCTTC AGAAGAAGCT CAAACTGACG CGGGAGGTGC CCGCCGATAT TCAGGTCCAG ATCGGCAGCC AATGGTGGTG CCTGCGCCGA CGCACCGTCG AAGCTGTGCT TGCGATGACA CGCAAACGCC GCGACGTGAT GCGCTTTTTT GCCTCGACCT GGATTCCGGA TGAGACGTTT TTTCAGACGC TTGTGCGCCA CCTCATTCCT GAAGATGAGA TCGAAAGTCG CACACTTACG TTTTTAATGT TCAGCGACTA CGGCATGCCG GTGAATTTTT ATAACGATCA CTATGATCTG TTGCTGGGGC AGGATTTCCT GTTTGCGCGC AAAATCAGTC CTGATGCAAA AGAACTGAAA ACGCGCCTCG GGCGTCTGTA TGCCGCGCGC GATGTGGAGT TCAAAATTTC CAACGAGGGG CGCAACCTTT ACAAGTTTCT GTCCGAGCGC GGGCGCACCG GACAGCGTTT TGCACCGCGC TTCTGGGAAA CCGAGAGCAG CCTCGGGCGC GAGCGTGAAT TGTTGATCCT CACCTGCAAG AAATGGCATG TGGCCAAACG TATGCTGGAG CAGATCCGCA CGCTCACCAA TACGCCCGCG ATCGAGTATC TTTTCCACGA AGAAGGCACG CCCCTGCCCG ATCTTGGCGG CATCCAGCGC ACCCTCGCCA AACGCACCCG TCATCGGCGC GCCCTGGTGC GGATGCTGTT TGACTATTAC GAGACCGACC GGCTGATTAT CTGCCTTGAT CCGTCCGCGC TCGAACTGAT GCATGATTTC TATTCAGACC GGTCCCACAC GCGGCTCCTG CGGATCGACT GCGATTTTTC GGACAGCTAC CTCATTGGCC ACGCGCATCG GGTCGGGCTC GCCGGTGAAC ATGCGGCCAA GGCAACGCTT GAGCGGCTGC TGCCCGCAAT CCGCAACGAC ATCAGCAATG AAATTGACCA AATCCGCGAT GCGGGTTTTG TGCGCCATTG GACGGTCGCG GAACGCGGCC CCGAGAGCGA CAACGCTCTT GCGGTTTCAC AGTTTCTGGA CGTGCCGGTT GAGACCGCCC TTGAGGTGGT GCGCACGCCC TATCTGTTCG CCGACTAG
|
Protein sequence | MAKIAYILLC HKDPEAIIQQ ANRLTATGDF MAIHFDARAK TRDYRAIRSA LSDNPNVTFA KRRVKCGWGE WSLVRATLNA LEAAVDEFPR ATHFYMLSGD CMAIKTAQYA RAFLDQHDKD FIESFDFFES DWIKTGMKED RLIYRHYFNE RTQKRLFYAA FELQKKLKLT REVPADIQVQ IGSQWWCLRR RTVEAVLAMT RKRRDVMRFF ASTWIPDETF FQTLVRHLIP EDEIESRTLT FLMFSDYGMP VNFYNDHYDL LLGQDFLFAR KISPDAKELK TRLGRLYAAR DVEFKISNEG RNLYKFLSER GRTGQRFAPR FWETESSLGR ERELLILTCK KWHVAKRMLE QIRTLTNTPA IEYLFHEEGT PLPDLGGIQR TLAKRTRHRR ALVRMLFDYY ETDRLIICLD PSALELMHDF YSDRSHTRLL RIDCDFSDSY LIGHAHRVGL AGEHAAKATL ERLLPAIRND ISNEIDQIRD AGFVRHWTVA ERGPESDNAL AVSQFLDVPV ETALEVVRTP YLFAD
|
| |