Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2409 |
Symbol | |
ID | 4076735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2548100 |
End bp | 2549374 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638007731 |
Product | lipopolysaccharide biosynthesis |
Protein accession | YP_614403 |
Protein GI | 99082249 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.352526 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTCTA TCTATTCCAT TGCAGATTTC ATCGACATGC TGCGCCGCCG TGTGTCGCTG ATCGTTGTCG TCACATTTCT GGGCTGTGTG GTGTCGGTCT GGCTGGCGCT GCAGAAGCAG CCGATCTATT CCAGCACCGA AGTGATCCAG ATTACCCGTC CAAAGATTGC CGGAGATCTG GCGCGCTCGA CTGCGGAGGG GTCCTCTGCA CGACGTATCC AATTGATCGA ACAACAGCTG ATGGCGCGCG GAACGATCCT AGAAATCGTG GATCAGCTTG ACCTCTTTGC GGACCGTCCC GGGCTTCTGG ATTCCGAAAT TGTCCCCCTG ATGCGCAACT CTGTTTCTTT GATGGGGACC GCCGCGGCCC GCGAAGGCGG GTCCGACGAT GGCGCGATCT CGGTGCTTAC GATCACCGCG AATATGCCGA CCCGCGAGCA AGCGCAGGCC GTCGCGCGCG AATTTTCCAA GCGCACCATT GCCCTCAGCC AGAACACCCG GCTCGCCGAG GCGCGCGAAA CCTTGCTCTT TTTCAATGAA AAAGAAGCGG CATTGGTGCG TGACATCACG GCGCTTGAGG AAGAGATTGC CAACTTCAGG CACGAGAACG CCGTGACCTT GCCCGGTGCA ATCGAGATGC GAGGCGCGGA AATTACGGCG ATCAATGAAA GCCTGCTGGA ACTCGCGCGA CAAGAGATCG AATTGCGCAA AGGTGCTGAG GAGGCCGAGG CAACACAACG TGAAGCCTAT GCGCGCCGGG TTCGCGAAGA GTTCGACGCC CAGCTCGAAA GCTTGACAGC GCAGCGCCAG CTGCTTGTGG ATCGCCGCGC CGAACTGGAG GCGTCTCTCG AGCTCACCCC GGACGTGGAT CGCCAATTGG CCAGCTATGA GCGGCGTCAA CAGCAGATGC AGTCCGAGCT TGAAGTCATC ACCGCGCGTC GCGCCGAGGC CGAGGTCGGG TTCCGACTGG AGACCGCCAG TCAAGGCGAG CGTCTGCGGG TGATCGAGCC CGCACCGCTG CCGGATTATG CGATGGGCGG TGGCCGCAAG TCTTTGGCGA TCAAGGGCGC CCTTGCGAGC TTTGTCTTGG GGGTTCTTGC GGCCTTTGCC CTAGACTTGC GCCATCCGGT TCTGCGGTCG AGCGGGCAGA TGAAGCGGGA AACCGGCCTG TCCCCGGTGG TGACGATCCC GGTTCTGACG ACGCGCAAAA AAGGTCTGTT TGCACGCCTC TTTGCGCTGG GCGGACGCGG CACGGAACGG CACGCGCGTA GCTAG
|
Protein sequence | MASIYSIADF IDMLRRRVSL IVVVTFLGCV VSVWLALQKQ PIYSSTEVIQ ITRPKIAGDL ARSTAEGSSA RRIQLIEQQL MARGTILEIV DQLDLFADRP GLLDSEIVPL MRNSVSLMGT AAAREGGSDD GAISVLTITA NMPTREQAQA VAREFSKRTI ALSQNTRLAE ARETLLFFNE KEAALVRDIT ALEEEIANFR HENAVTLPGA IEMRGAEITA INESLLELAR QEIELRKGAE EAEATQREAY ARRVREEFDA QLESLTAQRQ LLVDRRAELE ASLELTPDVD RQLASYERRQ QQMQSELEVI TARRAEAEVG FRLETASQGE RLRVIEPAPL PDYAMGGGRK SLAIKGALAS FVLGVLAAFA LDLRHPVLRS SGQMKRETGL SPVVTIPVLT TRKKGLFARL FALGGRGTER HARS
|
| |