Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_1480 |
Symbol | |
ID | 8568131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | - |
Start bp | 1727316 |
End bp | 1728794 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | Arabinogalactan endo-1,4-beta-galactosidase |
Protein accession | YP_003290755 |
Protein GI | 268317036 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCGGC CTCGTCGTAC GTGGCTGGCG CTCGGGCTGG TGTCGGTGCT GCTTGTAGGG CACGGCACCG CCCGGGCGCA GGCCTTTTAT TTCGGACACG ACCTCTCCTA CGCGAACCAG ATGGAGGATT GCGGGGCCAC GTTCAAAGAA GGGGGCGTCG AAAAAGACAT CTATCGGATT TTCTCCGATC ACGGTACGAA CCTGGTCCGG CTCCGCCTCT GGGTCGACCC CACCTGGCAG CGGCAGTTGC AGCAGCCGCC GGGCGTCAAG CCACAGTACA GCGATCTGGC CGACGTGCGG GAGGCCATCG CCCGCGCGAA AGCGGCAGGT ATGCAGGTGC TGCTGGACTT TCACTACTCC GACTTCTGGG CCGATCCCGG CCGACAGGTC ATTCCAGCCC GCTGGCGCTC GGTAGCCCAT AACCTGGAGG CGCTGAAAGA CTCGGTCTAT GCCTACACCT ACGCGGTGCT GACCACGCTG GACCGGGAAG GACTCATGCC CGAGATCGTC CAGGTCGGCA ACGAAAACAA CAGCGGAATC CTGATTCACG CCGACATGGA CGAAAATTAC AACGGGATTA ATCCCGTTGA CTGGCGCTGG TCGCGCCACG CGCAACTGTT CAATGCGGCC ATTCGGGCCG TGCGCGATGC CGGGGCCGCG GGCTCGGTGA TGCCAAAGAT CGCACTCCAC TACGCCGGCC TGAACGGCAG CGTGCAGCAC TTCCAGCGGC TGATCAGTCT GGGCGTGACC GACTTCGACA TCATGGGGCT TTCGTACTAT TACGCCTTCC ACGGCGGGAG CATTGCCGAG CTGGGCGCTA CGATTCGGGA GCTGGTCGAC CGCTTCCCGG ATTACCAGAT CATGGTGCTG GAAACCGCCT ACCCCTGGAC CTCGCGCAAC TATGACGCAC TGGGCAATCT GCTCAACACG CAGGATCCCG ACTACTATCC GTTTTCGCCC GAGATGCAGC GCACCTATAT GGTGGACCTG ACGCGCACGG TCATCCAGGC GGGCGGGAAG GGAGTAGTGT TCTGGGAACC GGACTGGGTC TCGACGCCGT GCCGCACGCC CTGGGGGCAG GGCTCTTCCT TCGAGCACGT GGCCTTCTTC GAACCGGGCA CCTATAACCT GATTGCCAAC GGTGGGATTG GCTGGACGCA CCGCGAGTGG TACGCCGATC TGCTGACAAC TGGCGAGGCC ATCACGCCCG AGACGCCGGT GCTGCTGGAA GCCGTCTATC CCGTGCCCTT TCGAAATCGC CTCACCGTAA CGTATCGGCT GGTCCGACCG CAGCAGGTTA CCGTACAGGT GCTCGATGTA CTGGGACGCA TTGTAGCCAC GTTGAGTACG GGGCGGCAAC CGGCCGGCCT GCACCGCCTT TTCTGGCAAC CCGCAGCGCT CCCGACCGGA CGCTACCTGC TCCGCGTGCA GACGTCGGAC CGTACGGAAA GCCGCTGGCT GTTTTACATC CCCGAATAG
|
Protein sequence | MQRPRRTWLA LGLVSVLLVG HGTARAQAFY FGHDLSYANQ MEDCGATFKE GGVEKDIYRI FSDHGTNLVR LRLWVDPTWQ RQLQQPPGVK PQYSDLADVR EAIARAKAAG MQVLLDFHYS DFWADPGRQV IPARWRSVAH NLEALKDSVY AYTYAVLTTL DREGLMPEIV QVGNENNSGI LIHADMDENY NGINPVDWRW SRHAQLFNAA IRAVRDAGAA GSVMPKIALH YAGLNGSVQH FQRLISLGVT DFDIMGLSYY YAFHGGSIAE LGATIRELVD RFPDYQIMVL ETAYPWTSRN YDALGNLLNT QDPDYYPFSP EMQRTYMVDL TRTVIQAGGK GVVFWEPDWV STPCRTPWGQ GSSFEHVAFF EPGTYNLIAN GGIGWTHREW YADLLTTGEA ITPETPVLLE AVYPVPFRNR LTVTYRLVRP QQVTVQVLDV LGRIVATLST GRQPAGLHRL FWQPAALPTG RYLLRVQTSD RTESRWLFYI PE
|
| |