Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3843 |
Symbol | |
ID | 4074906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008042 |
Strand | + |
Start bp | 90406 |
End bp | 92100 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 638004500 |
Product | hypothetical protein |
Protein accession | YP_611235 |
Protein GI | 99077976 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.101575 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAATG TTCTTCCGTT TGGAAAAAAA ACGCATTCCT CGCTTAGTGA GAACACTCAA GACGTATTAG ACCTGAGCAC ACTGACGGAT GGCGCGGCGG ATTTCTCCAC CGGAATGGTT GACCGGTCAT CAGGGGGCCT CTCACACCTC CCTAACCAAA CTGCAGTCAC ACCCGATTGG GCCAATCAAG AAATTGCTAG CTTCATGCGA GCACATCGGT TGCTTGCCAT TGCGGGTCTC TCCTTGGAAA CAGAGCGCGG AGTGACCGAC GAAGGTGACC CTTGGTTTGT TTTCGTAAAT CAAGATGGAG ACGTATTCGC GCATTTCGCG CGCATCGGCG AAGTCTACAT CCTCGACAGT ATTATTCAGC AGGAAATTAC AAAAGCCAGA AGCTTGGATG AACTGATTTC AGCCTTTGCA GAAACCACTA CACCGGTGGA CAGTGTAGAC CACGCTTCAA CATCTGCCAC AGTTGTTCCC TTCATCGCCT TGCGAGAATC AAAAGTACGT CTACACCCAG GCGCAACCTT GGCCGCTCTT ATCTGGACAA TCTATATCCA GTCAGGAGAA CTTGCGGTCC CATCATTTAG TGCTGCGCTG GATGTGACTG CAACGTCAAA GGCCGATGAA GTGACCACGC TACCCTCTCA AACCCGGGCC TCCCCCTCGG AAGAGTTGGC TACGAGTACA TCTGAAGATA ATTCAGCAGA TCTCAAAAGT ACTGAAAAGG ATACACACAC TCAAACGCCC TCAAGCGCTC AAGTGGCAGC GACATTTACT ACCGTGCAGA CGGTTGGAAT GGGATTGAGC GCTATAGCCA TCTCGAATGG CATGTACTTT TGGGTTTCAG GTGAGACCTT GCTCGAACAA GCGACACTGG CATCTCAGCT AGTCGCCGAA GTAATCAAAG ATATATCTGA TGAAGAAACT GAGCAGCTTG CCGATCTGTC GGAGCTAGAT AGCGTATTAT CCACAGTTCG CGCAGCGATA GAGGACAGTT CAGAGGCAAT GGCTTTGGCT CAATCGCCAA TACGAGATAT TCCTACTGTC GACTTGGCAT CCATGCCGGC TATCGTTGCT AACGCTGTTG AAAATGGAAA AGTTGGGAAC GCAATCAAAG CTAAAGACGT TCAAGGCGAC GTAGCTTTTT CTGAAGAGAT GGATGGTGAG CTCATCAAGT TCCGGGATAT TCAGGCGAAC CGCCCGGAGG AAATAGAGAC ATCTCAGATT CGCTCTACAC CAGTAGACAT TGAGAAAAAA TATATTTTAC AAGAGCTTGA CGAGGTTGAG CACTTCTCGC TTACGTCCGA CGCTTTTAGC AATATAGCTA CCCAGCTTTC TGACCTACCA TTCTTCACGC AAATGCTCAG CGGCTCATTC CAAGCGGTCG TTGCCAGTGA GGGTTCTCAA ACTGTGCCCG CGTCAAGGCC CGACACCCTC AATGATGCCT CCGAGACGGC ACCACGCTTC TCGATCTTTA ACGATGATGC GCACAATTTT ATTGTATTCT TGATGAGTAA AGGGGATGAG ACCAAACGGT CTGACTACGA TAACGAAGTT GTACTCTTTG ACTTTGATGC AATCGATAGT CAAACAGACG CGATCTATGC TCGTAGCTGG TCCTTCGAAG ATGGATCTGT GATTTCTGCT GTAGGTTTAA AAAGCGACTT TGCTGCTTTT GATCTTGTGA TCTAA
|
Protein sequence | MNNVLPFGKK THSSLSENTQ DVLDLSTLTD GAADFSTGMV DRSSGGLSHL PNQTAVTPDW ANQEIASFMR AHRLLAIAGL SLETERGVTD EGDPWFVFVN QDGDVFAHFA RIGEVYILDS IIQQEITKAR SLDELISAFA ETTTPVDSVD HASTSATVVP FIALRESKVR LHPGATLAAL IWTIYIQSGE LAVPSFSAAL DVTATSKADE VTTLPSQTRA SPSEELATST SEDNSADLKS TEKDTHTQTP SSAQVAATFT TVQTVGMGLS AIAISNGMYF WVSGETLLEQ ATLASQLVAE VIKDISDEET EQLADLSELD SVLSTVRAAI EDSSEAMALA QSPIRDIPTV DLASMPAIVA NAVENGKVGN AIKAKDVQGD VAFSEEMDGE LIKFRDIQAN RPEEIETSQI RSTPVDIEKK YILQELDEVE HFSLTSDAFS NIATQLSDLP FFTQMLSGSF QAVVASEGSQ TVPASRPDTL NDASETAPRF SIFNDDAHNF IVFLMSKGDE TKRSDYDNEV VLFDFDAIDS QTDAIYARSW SFEDGSVISA VGLKSDFAAF DLVI
|
| |