Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0594 |
Symbol | |
ID | 4078632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 635522 |
End bp | 636592 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638005891 |
Product | GumN |
Protein accession | YP_612589 |
Protein GI | 99080435 |
COG category | [S] Function unknown |
COG ID | [COG3735] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.241158 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.100531 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTTTCCC GCGCGCAACG CTCCGGTGAC AGACGGCATG CGCTTGGCCA CTCTGGTGCT GTCCCTGCCA CGCTGCGGCC GTTTCTCTCG TTGATCTGTT TCTGCTTGGT CGCAGTCTGG ATGGCCACGC CTCTTCGGGC CGCCTGCAGC GGCCTAGACC TGCGGGAGAG CGCCGCGCCA GGTATGTGGG CAGAGATCGA GGCGGAGATT GCCCTCACCC CGTTTGCGCG CGGGCTCAGC TGGGTCGCAA CCCGCGGCGA TCGTCGCCTA CATATCATTG GCACCATGCA TCTGAACGAT CCGCGCCACG ATGCGCTGGT TGCGCATATG GCGCCTGCAA TTCAGGCGGC CGATGCGGTC TTGCTCGAGG TCAATCGTGC AGACAAGGCC AGACTCGAGC GCACCCTGGC TGAGACGCCA TCGCTCATCT TTATCACCGA GGGGCCGACC TTGATCGACC GCCTGCCCCC AGAGGAGTGG GACGAGATCG CAGAGCGCGC CGGCGCTGCC GGAATCCCGC CTTTCATGGC GGCCAAGATG CGACCCTGGT TTCTGTCATT GTCAATGTCT GTGCCACTCT GTGCGCGCGC AATTGAGGAT GTCACTGATG GGCTCGACAT GAAATTGATG ACCCTGGCAG AGCGCGCCGG CGTCGAAACC CTATCGCTCG AAGATCCTCT GGGCTTGTTC CAGTTGTTTG ACGCCACGCC AATCGAAGAG CAGATCAAGG AACTGCGAAG CTACATCGCC ATGGCAGGCG TGGGCGCGGA TGATTTCTAC ACGATGGTGG AAAGCTATTT TGACGGGGAG GTTCAGGCCT ACGTCCTTTT GCAGTTGAAG CAATTTCTCA CATCAGAAAG CGACCTCTCG GTAGCAGAGC GTCAGGCGCA GGTGGATGAC CTGCTGGAGG CTCTGCTCTA CCAGCGCAAC CGTGATTGGA TCCCGGTCAT AGAAGCCACT GCGGGCAACC GACTGGTGAT CGCGGTGGGC GCGGCGCATC TCCCCGGAGA GGCCGGGGTG CTGGCCCTTC TCGAAGCAGA GGGTTATCAA ATTGAAGAGG CGCTCCGGTG A
|
Protein sequence | MLSRAQRSGD RRHALGHSGA VPATLRPFLS LICFCLVAVW MATPLRAACS GLDLRESAAP GMWAEIEAEI ALTPFARGLS WVATRGDRRL HIIGTMHLND PRHDALVAHM APAIQAADAV LLEVNRADKA RLERTLAETP SLIFITEGPT LIDRLPPEEW DEIAERAGAA GIPPFMAAKM RPWFLSLSMS VPLCARAIED VTDGLDMKLM TLAERAGVET LSLEDPLGLF QLFDATPIEE QIKELRSYIA MAGVGADDFY TMVESYFDGE VQAYVLLQLK QFLTSESDLS VAERQAQVDD LLEALLYQRN RDWIPVIEAT AGNRLVIAVG AAHLPGEAGV LALLEAEGYQ IEEALR
|
| |