Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1453 |
Symbol | |
ID | 4077750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1551847 |
End bp | 1553010 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 638006764 |
Product | hypothetical protein |
Protein accession | YP_613448 |
Protein GI | 99081294 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.445258 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCAAA CGCGCTTCTT TAAAATGGCG CAACGAGCAG ATTCAATTTT CGACGGAGGC ATCACAGGCA TGCCCACCCA TGACTTTTTT GTCTACACGC CGGATGCGCT GACGTTTTCC GGCGGTACGA TCCGCGTAAA CTCTGGTTTT GACCCGCTTT CTGATCGGCG CGTTGTGTCT CTCACAGATG ACGATTCCAC GCTGGATGGG GATTTCACCC GAAATGAGCG CGGTATCGAC AGCAACCAGC GGGCAACGGT TTTTGAAAGC GATGGCAGCA CATTGGCGCG TATCGGCGGA GATTTTGTGC AAAACGACCA AGTCTATGCA GAAAATCAAT ATGTGCTGAC CGGCGATGAT GGCAGCGAAA TTACCGTTTA TGCGCTGGAA AGCGGCGGCA CACTTATTGG CTATCTTCCA ACTGCGCCGC TTTTACGGGA TGTGAATTAC AGCTACCGGA CCCACAATGT CATCAACGAT GACGAGCTGA CGGGCTCCTA TCAATATTGG TATCGTCAAT ACCTCGGCGA GGACGCGACC GATGCCGGAT CCTATGATGA CATTCAGGGG GCCGTGATCG TGTGTTTTAC GCCCAACGCG CAGGTCACCA CCCCGGAAGG TCCGCGCCGC ATCCGGGATC TTGCCGTGGG CGATCGCGTG CTGACACGGG ACAATGGTTA TCAGACCCTG CGGTGGAAAT ATCACCGCCG CCTGTCACGC GCAGATCTCG ACCGTCAACC GCATCTGGCG CCAATCATCT TTGAGCCAGA CGCACTGGCG CCGGGCTGCC CCAAACGCCG CCTCAAAGTG TCTCCACAGC ACCGGATTCT CATTGAATCC CACATGAGCG GGCTCCTCTT TGCCAGCAAT GCCATCCTCG CACCGGCCAA GGGGCTGGTG AACGGCACCA GTGTGCGACA GGATCAAAGT GGCATGCCGG TCACCTATGT GCATCTGATG TTTGATCGCC ATGAGGTGAT CGAGGCCGAT GGGCTCTATT CCGAAAGCTA CCATCCCGGT GCCTGGGCGC TTGCGGCTGC CGAAGATGCG GTCCGGCGCG AGCTTTTTGA GATATTCCCG GCTCTGGAGG CGGATCTCAA TGCCTATGGC CCTACATGTT ATCCCTCTAT AAATGCTCAA GAAGCACGTC TACTGAAGGC ATGA
|
Protein sequence | MRQTRFFKMA QRADSIFDGG ITGMPTHDFF VYTPDALTFS GGTIRVNSGF DPLSDRRVVS LTDDDSTLDG DFTRNERGID SNQRATVFES DGSTLARIGG DFVQNDQVYA ENQYVLTGDD GSEITVYALE SGGTLIGYLP TAPLLRDVNY SYRTHNVIND DELTGSYQYW YRQYLGEDAT DAGSYDDIQG AVIVCFTPNA QVTTPEGPRR IRDLAVGDRV LTRDNGYQTL RWKYHRRLSR ADLDRQPHLA PIIFEPDALA PGCPKRRLKV SPQHRILIES HMSGLLFASN AILAPAKGLV NGTSVRQDQS GMPVTYVHLM FDRHEVIEAD GLYSESYHPG AWALAAAEDA VRRELFEIFP ALEADLNAYG PTCYPSINAQ EARLLKA
|
| |