Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3712 |
Symbol | |
ID | 4075419 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 770332 |
End bp | 771564 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638005232 |
Product | hypothetical protein |
Protein accession | YP_611941 |
Protein GI | 99078683 |
COG category | [S] Function unknown |
COG ID | [COG5441] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0523031 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGGTG AAAAGACGAT CCTTGTGGCC GGGACCTGGG ATACCAAGGA TGATGAGCTG TCTTATTTAT CGGAAGTGAT CCGGGGGCAG GGCGGTCAGG TGCTCAGCAT GGATGTGAGT GTGTTGGGCG AGCCCAAACT GCCCACGGAT GTCTCAAAAC ACGACGTTGC CGAGGCGGCG GGCAGTTCCA TTCAGAGGGC CATCGACAGC GGAGATGAAA ATACCGCGAT GCAAATCATG GGGGCGGGCT CGGCCAGGCT TGCGCTGGAT CTGTGGCGCG CGGGGCGCAT CCATGGCGTG ATCGTGCTCG GGGGCACCAT GGGCACCGAT CTTGCGCTCG ACCTCTGTGC TGCGTTGCCT TTGGGGGTGC CCAAATATGT CGTCTCGACC GTGGCATTCT CGCCGTTGCT GCCACCGGAG CGCATCCCGG CGGATCTGCA GATGATCCTT TGGGCCGGGG GGCTCTATGG ATTGAACGAC ATCTGCAAAG CATCGCTCAG TCAGGCTGCG GGTGCCGTTC TGGGCGCCGC GCGCGCGGTG GAGGCGCCCA GTTTTGAGCG TCCGATGGTG GGCATGACCT CCTTTGGAAA GACGGTGCTG CGCTACATGG TGACGCTTAA ACCAGAGCTT GAGAAGCGCG GATTTGATGT GGCGGTCTTT CATGCCACCG GCATGGGCGG GCGCGCCTTT GAGAGCCTTG CGGGGGAGGG CGCTTTTGCG GCGGTGATGG ATTTTGCCCC TCAAGAAGTG AGCAATCATC TCTTTGGCGG CTTGTCGGCG GGCGAGGGGC GCATGACACA CGCCGGGCAT GCGGGTGTCC CGCAACTGAT TGCGCCGGGA TGCTATGACC TTGTGGATTT TGTCGGCTGG CAGGGTGCGC CGGAGCAACT GCGCGGACGG GAGTGCCACG CCCATAACCG CTTGCTGACG TCGGCCATGC TTGATGCGCG AGAACGACAG CGCGTCGCGC AAGAGATGTG CAACAAGCTT GCCCGGGCCT CAGCACCAGT CACGGTGTTC TTGCCCCGCG CGGGCTGCAA CGAATGGGAC CGCGCCGGCG GCGATCTGCA TGATGCGGAA GGGCTTCGGG CCTTTTGCGA TGAGATGCGT CGCGGAGTTC CGGAGAACGC GCAACTGCAG GAGCTCGACT GCCACATCAA TGACGCCGAA TTCACCAATG CGGTGCTGGC ACAGTTTGAT GCCTGGATCA AAGAGGGCGT GATCGTGCGC TGA
|
Protein sequence | MNGEKTILVA GTWDTKDDEL SYLSEVIRGQ GGQVLSMDVS VLGEPKLPTD VSKHDVAEAA GSSIQRAIDS GDENTAMQIM GAGSARLALD LWRAGRIHGV IVLGGTMGTD LALDLCAALP LGVPKYVVST VAFSPLLPPE RIPADLQMIL WAGGLYGLND ICKASLSQAA GAVLGAARAV EAPSFERPMV GMTSFGKTVL RYMVTLKPEL EKRGFDVAVF HATGMGGRAF ESLAGEGAFA AVMDFAPQEV SNHLFGGLSA GEGRMTHAGH AGVPQLIAPG CYDLVDFVGW QGAPEQLRGR ECHAHNRLLT SAMLDARERQ RVAQEMCNKL ARASAPVTVF LPRAGCNEWD RAGGDLHDAE GLRAFCDEMR RGVPENAQLQ ELDCHINDAE FTNAVLAQFD AWIKEGVIVR
|
| |