Gene Rmar_2032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_2032 
Symbol 
ID8568689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp2362364 
End bp2365444 
Gene Length3081 bp 
Protein Length1026 aa 
Translation table11 
GC content66% 
IMG OID 
ProductAminopeptidase N-like protein 
Protein accessionYP_003291301 
Protein GI268317582 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACCCG CCGTACGGAT CGCATCGGTG CTCCTGCTCT GGCTGCTTTC TGCCGCTTTC 
GCCGCGGCCC AGACCTCGCC TGACTGGCAA CAGCGCGTGC GCTACGAAAT GGACATTCGC
CTCGACGCCG CGCAGCATCG CATGCGCGGC CGTTCGCGCA TCGTCTACTA CAACCATTCG
CCCGACACGC TCCGTCACGT CTTTTTTCAC CTGTACTTCA ACGCCTTTCA CCCGCAGTCC
ATGATGGCCG AGCGCAACCG GCATCTGCCC GACCCCGACC GGCGGGTGGT TCCGAAAATC
TGGCGACTCG GTCCCGACGA ACAGGGCTTT CACCGCATCA CCTACCTGGC GCAGGAAGGC
AAGCCGCTCA CGTTTCGCAT CACCGACACG GTGCTGAAGG CCGAGCTGGT CCGTCCGCTG
GCACCGGGCG ACTCGACCGT CTTCGAGATC CGCTTTCACT CGCAGGTGCC GCTCCAGACA
CGTCGAAGCG GCCGCGACAA TGCCGAAGGG ATCGACTTTT CGATGAGCCA GTGGTATCCG
AAGATCGCCG CCTACGACGG CCGCGGCTGG CATCCCGATC CGTACATCGG CCGGGAGTTC
TACGGCGTCT TCGGCACGTT CGACGTACGC ATCACGCTGC CCGCCTGCTA CACGATCGGG
GCCACGGGTG TATTGCTCAA TGCGGACGAA GTCGGGCATG GCTACGACCG CATCAGTAGC
CGGGGATGGA CCCGCTTCGA TCCACGCGAA GCGGGCCTGC ACTGCACACC CGGCGACTCG
CTCACCTGGC ACTTCCGCGC CGAGCACGTG CACGACTTCG CCTGGGCGGC CGATCCGGAC
TACATCCACG AAGCCTGGCA CGACGACAGC CTGGGCGTCA CCTATCACCT GCTCTTTCAG
CCCGACGTAG CCGAACGCTG GCAACCCATG CGCCAATGGG TGCCCTGGCT CATTCGCTAC
TTCAGCCGAC GCATCGGGCG CTACCCCTAC CCGCAGTTCA CCGTCGCGCA GGCGGGCGAC
GGCGGCATGG AATACCCGAT GATCAACTTC ATCACGGGAC GTCGCAGTCC GTTTTCGCTG
CTGGGCGTGA CGGCCCACGA AGCGGCGCAC GAGTGGTTCT ACGGCGTGCT GGCCTCCAAC
GAGAACGCCT ACGCCTGGAT GGACGAAGGC TTCACGAGCT GGGCCACCAC CGAGGCCGTC
GCGCACCTGC TGGGTCAGGC GCCCGACCAT CGGGACGCCG CGCTCAGCGT CGTGCGCCTG
CAACAGATGG GCCTGTTCGA GCGCCTGAAC ACGCCCGCCG ACTGGTTCGC TTCCAACGTG
GCCTACAGCG TGGCCGCCTA CCCCGGCGGC GAAATGCTGC TCGATCTGCT GGGCTATGTG
ATCTCCGATT CGCTGCGCGA CGTCTTCCTG CAGGTCTACT TCCGCACTTA CGGCCTGCGC
CATCCCAATC CTTACGACGT CGAAAAAGTC GCCGAGCAGG TCAGCGGCCT GCGCCTGGAC
TGGTTCTTCG AGCAGCTGAC CAACAGCACC TACACCTGCG ACGACGCGCT GGAAGTCGCC
TCACAGGAGC GCACGCCGGA AGGCTGGCGC GTAACGATCC GGCTTCATCG CCGGGGATCC
ATGTTTCTGC CCGTCGATCT CCGGCTGACG CTGGCCGACG GGAGCACGCA GTGGGTGCAT
GTGCCGCTCG GCCTGGCCCA GGGGCACAAG CCGGTACCGC CCGACTGGAT CGTGGCCGAG
CCGTGGCTGT GGACGTTCCC CCGCTACACG CTAACGCTCA CGCTACCGGC CCGCGTCGTG
CGGGCCGAGC TGGACCCGCT TCAGCGTACG CCCGATCACA ACCGCCTGAA CAACACCTGG
CCGTTTCCGC TGCAGCTTTC CTTCCTGCAG CCGCTTTCGT TCGATCCGGC CGCCTACCGG
GCCACGTGGC GCCCGCTGGC CGCTTACGCC TACGACTTCG GTCCCGGCAT CGGACTGCAA
CTGCGCGGCC GCTACTTCTT CGACCGCCAC GAAATGCTGC TGACGCTGAA GCTCTGGCCC
GAAGTGCTGC TGAGCAACGG CCGACGACCG GAGCGCCCCT GGGTGCGCGA CCGCAACGCC
TGGTGGGCGG GCATCGACTA CACGCTGCGC TACAGCGATC GGCTGGCCCC CCGCACCCGC
TGGCACCTGC AGCTCGAAAA GCACCTCGGC GTACTGGAAA ACCGCATCGG CCTCACGCAC
ACGCTGGGCC GCTGGGCGGC TCTGGGCCAC GACTACGGAA CGGTCACGCT GGAGCTGACC
CACCAGTACA CGCCGGGCTT TCGCACTTTC TCATTCGAAA ACGTCCCGGT GTGGACACCG
GGCATGCATC TGGTGTGGAC CGGGCTGCGT TATCGCATCG AACGTCCCTT CGGCCAGCTG
CATGCGCTGC TGGAAACCGG TGAAGGTGAA AACGGCAAAG GAGCAACCCG AGGAGTGCTC
GACCTGCGCT GGCATTTCCT GCGCCGTCCC ACCTTGCGGG CGACACTGCA TGGCCAGGTA
GGCTGGGGCG ATGGCCAGCT TCCCTTCAGC CGCGTCTTCC GGCTGGGAAG TGCCCCGGTC
GAAGCGGCCT GGCGCCATGC GGGCTTCCGG AGCGTGGCCG CGCTGTTTGC CGACGCCCGC
AAGTCGCTCC ACCTGATCCC GCTTGACGCT CCAGGTCCCG TGGCCTACTG GAACCCTGAT
GGGCGCGACC CGCTGCGCGT GCAGGGCAAT GTGCTGGTAG CCGCCAGTCT GCAACTGCAA
TGGACGCCTT TCCGGGCGCG TCTGCTGCGC CCGCTCCAGA TGGAAGGCTT CTTCGGAATC
GGCCAGACCT GGTTCACCGA GCCCACACTG GCCAGCCGCT ACACGTTCGG CTGGGATCGT
TTTCTGGCCG ACGCAGGCGT GGGCCTCGGC TATAACGTAA GCGACCTGCC CGGCCTGCGG
CGCTGGACGG CCCTGTCGGA ACTGTTGCAG GACCTGCGGC TGCGCCTGCG GGTGCCGCTG
TGGGTCAGCG ATCCGGACCT GATCGGCGAA CGCGACGCGC TGCGGTTCCG CTGGATGCTG
GGGATTGTGG TCGGGCCTTG A
 
Protein sequence
MPPAVRIASV LLLWLLSAAF AAAQTSPDWQ QRVRYEMDIR LDAAQHRMRG RSRIVYYNHS 
PDTLRHVFFH LYFNAFHPQS MMAERNRHLP DPDRRVVPKI WRLGPDEQGF HRITYLAQEG
KPLTFRITDT VLKAELVRPL APGDSTVFEI RFHSQVPLQT RRSGRDNAEG IDFSMSQWYP
KIAAYDGRGW HPDPYIGREF YGVFGTFDVR ITLPACYTIG ATGVLLNADE VGHGYDRISS
RGWTRFDPRE AGLHCTPGDS LTWHFRAEHV HDFAWAADPD YIHEAWHDDS LGVTYHLLFQ
PDVAERWQPM RQWVPWLIRY FSRRIGRYPY PQFTVAQAGD GGMEYPMINF ITGRRSPFSL
LGVTAHEAAH EWFYGVLASN ENAYAWMDEG FTSWATTEAV AHLLGQAPDH RDAALSVVRL
QQMGLFERLN TPADWFASNV AYSVAAYPGG EMLLDLLGYV ISDSLRDVFL QVYFRTYGLR
HPNPYDVEKV AEQVSGLRLD WFFEQLTNST YTCDDALEVA SQERTPEGWR VTIRLHRRGS
MFLPVDLRLT LADGSTQWVH VPLGLAQGHK PVPPDWIVAE PWLWTFPRYT LTLTLPARVV
RAELDPLQRT PDHNRLNNTW PFPLQLSFLQ PLSFDPAAYR ATWRPLAAYA YDFGPGIGLQ
LRGRYFFDRH EMLLTLKLWP EVLLSNGRRP ERPWVRDRNA WWAGIDYTLR YSDRLAPRTR
WHLQLEKHLG VLENRIGLTH TLGRWAALGH DYGTVTLELT HQYTPGFRTF SFENVPVWTP
GMHLVWTGLR YRIERPFGQL HALLETGEGE NGKGATRGVL DLRWHFLRRP TLRATLHGQV
GWGDGQLPFS RVFRLGSAPV EAAWRHAGFR SVAALFADAR KSLHLIPLDA PGPVAYWNPD
GRDPLRVQGN VLVAASLQLQ WTPFRARLLR PLQMEGFFGI GQTWFTEPTL ASRYTFGWDR
FLADAGVGLG YNVSDLPGLR RWTALSELLQ DLRLRLRVPL WVSDPDLIGE RDALRFRWML
GIVVGP