Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3478 |
Symbol | |
ID | 4075112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 502467 |
End bp | 503726 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638004987 |
Product | hypothetical protein |
Protein accession | YP_611712 |
Protein GI | 99078454 |
COG category | [S] Function unknown |
COG ID | [COG3748] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.300585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGAGC TTGTCATCAT GTGGGACTGG CTGGGGTTTG CCGTCCGCTG GCTACATGTC ATCACCGCGA TCGCCTGGAT CGGGTCGTCC TTTTATTTCG TGGCGCTGGA TCTGGGGCTG CGCAAGGTGC CGCATCTTCC GGTGGGCGCG CATGGTGAAG AGTGGCAGGT GCACGGCGGT GGCTTTTACC ACATCCAGAA ATATCTGGTC GCCCCGGCCA ATATGCCCGA CCACCTGATC TGGTTCAAAT GGGAGAGCTA CGCCACCTGG CTCTCGGGGG CTGCGCTCCT GATGATCGTC TACTGGGCGG GCGGCGAGCT CTATCTCATT GACGCAAACA AGGCCGACCT GGCGCTGTGG CAGGGGATCC TGATTTCCGG CGCATCGCTG AGCGTGGGCT GGCTGGTCTA TGACTTTTTG TGCAAATCCC CGCTTGGCGA AAAGCCGACC ATGCTGATGG TGCTGTTGTT CGTGCTGCTG GTAGCCATGG GTTATGGTTA CAACCAGATC TTTACCGGCC GGGCGGTGAT GCTGCATCTG GGGGCCTTCA CGGCGACCAT CATGACGGCG AATGTGTTCT TTATCATCAT GCCGAACCAG CGCATCGTGG TAAAAGACCT GCAAGAAGGG CGCACGCCGG ATGCAAAATA CGGCAAGATC GCCAAGCTGC GCTCGACGCA CAACAACTAC CTGACCTTGC CGGTGATCTT CCTGATGCTC TCCAACCACT ATCCGCTGGC CTTTGCCACC GAGTACAACT GGCTGATTGC GGCGCTTGTG TTCCTGATGG GGGTGACGAT CCGTCACTAC TTCAACACGC GGCACGCGGG CGCGGGCAAT CCGACCTGGA CCTGGCCCGT GACGATCCTG CTGTTCATTG CCATCATGGC ATTGAGCCAA GCGCCGCTGG TGCAGGACAC CTATGAGGAG TCCGAGGCGC GTACACTGAC CCAAACCGAG CTGGCTTTTG CTGGCTCGGA GCATTTCGAG GACGTGATGA ATGTGGTGCC GGGACGTTGC GCCATGTGCC ACGCGCGCGA GCCTTATTAC GACGGCATCC GCCGCGCGCC CAAGAATGTG TTGCTCGAAA CACCGGCAGA TGTCGCCAAA TACGCCCGGC AGATCTATCT TCAGGCCGGG GCGACCCATG CAATGCCGCC TGCAAATGTG ACCTATATGG AAGAGGACGA GCGCGCGCTC ATCCGTCAAT GGTATGCAGA TGGCGCGGCA GAGCTGCCCT TCTATCTGGC GGGCAACTGA
|
Protein sequence | MQELVIMWDW LGFAVRWLHV ITAIAWIGSS FYFVALDLGL RKVPHLPVGA HGEEWQVHGG GFYHIQKYLV APANMPDHLI WFKWESYATW LSGAALLMIV YWAGGELYLI DANKADLALW QGILISGASL SVGWLVYDFL CKSPLGEKPT MLMVLLFVLL VAMGYGYNQI FTGRAVMLHL GAFTATIMTA NVFFIIMPNQ RIVVKDLQEG RTPDAKYGKI AKLRSTHNNY LTLPVIFLML SNHYPLAFAT EYNWLIAALV FLMGVTIRHY FNTRHAGAGN PTWTWPVTIL LFIAIMALSQ APLVQDTYEE SEARTLTQTE LAFAGSEHFE DVMNVVPGRC AMCHAREPYY DGIRRAPKNV LLETPADVAK YARQIYLQAG ATHAMPPANV TYMEEDERAL IRQWYADGAA ELPFYLAGN
|
| |