Gene Rmar_2114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_2114 
Symbol 
ID8568775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp2469143 
End bp2470948 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content68% 
IMG OID 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_003291382 
Protein GI268317663 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCCCC TGCTGCTGCC GGGCACGGTC TGGCTCGACA CCGCCCAGCC GGACGAAGAA 
AACCGCCGGA GTCTGCTGTT CATGCGGCCG GTCCGGGTGC TGCAGGCCGA CACCCCGGAG
CAGGTGCCAG CGCTGCTGCG GGCGCTCGAC GGTGCCGTCG CGGCCGGTTA CTACGTGGCC
GGCTACATGG CCTACGAGGC CGGCTATGCG CTGGCCCCCG TGCCGCTTCA GGTGCCTGAC
GAAACCGGAC CGCTTGCCTG GTTCGGCGTC TATGAAATGC CGCACGTGCT GGCACCGGCC
AGTACAGCCG CGCTGGCAGG GAAGACCGGC GATTATGCCG TGCGGGATCT GCACCTGGCG
CTTTCGCGCG AGGCCTATCG CGAGCGCGTG CAGCACATCC GCGCACTGAT CCGGGAAGGC
GAAGTCTATC AGATCAACTT CACGCTGCCG CTGCGTTTTC GCTTCGAGGG CGATCCGATC
GCGTTTTTCC TGGCCCTTCG GCGCCAGCAA CCGGTTCCCT ACGCCGCCTT TGTCAACACG
GGCGAGCGGC TGGTGCTGAG CCTCTCGCCA GAGCTTTTCT TCCGGCGCAA CGGCGAACAG
ATCTATACGC GTCCGATGAA AGGCACGGCG CGGCGTTCGT CGCTCCCGGA GGAAGATGCC
CGGTTGGCCG AGGCGTTGCG CACCGACGAA AAGAACCGGG CGGAAAACCT GATGATCGTC
GATCTGCTGC GCAACGACCT GTCGGTCTGC TGCGAGCCGG GTTCGGTGGC GGTCTCCGAG
CTGTTTCGCG TCGAAGCCTA TCCGACGGTC TGGCAGATGA CCTCGACGGT AACGGGACGG
CTTCGGTCCG GGGTGGGCTA TGCCGAGCTG TTCCGGGCGC TGTTTCCGTC GGGGTCCGTG
ACCGGCGCGC CCAAGCTCCG GGCCATGCAG CACATTGCAC GGCTGGAACC GGCTCCGCGG
GGGGTCTACT GTGGCGCGAT CGGCTATGCG GCACCGGACG GCGAGGCGGT GTTCAACGTG
GCCATCCGCA CGCTGGAGCT GGCCGGATCG GAAGGGCGTA TGGGCGTGGG CAGCGGGATC
GTGTGGGACT CCGATCCGGA CGCGGAATAC GAGGAGTGCT GGCTGAAGGG GCAGTTTCTG
CGGGCGGCCG CCGAGCCGTT TGCGCTGATC GAAACGATGC GCTGCGAACA GGGGCGCATT
CCGCTGCTGG AACTGCACCT GGAGCGTCTG CGCCGGTCCG CCGCGCATTT CGGGTTTGCG
CTGGACGAAG GGCGGGTGCG GGCCCAGCTG GAGCAGGTGC AGCAGGCGCT GGACCCTGCG
AAGGTGTGGC GGTTGCGTCT GACGCTGGAG GTTTCGGGCC AGACGCAGCT GACCACCGCC
GAGCTCGAAC CGGAGCCGGA TCGACCCTGG CGGCTCTGCG TGGCGCGGGA GCGGCTGGAC
CCTTCCGATC CGCTGCGCTA CCACAAGACG ACGCGCCGCG CGCACTACGA GGCGGCCTAC
CTGCAGGCGC AGGCGGCCGG CTTCGACGAG GTGCTGTTTC TGAACACGCG GGATGAGGTC
TGCGAGGGTT CACGCACCAA CCTGTTCGTG CAGCTCGACG GGCGGCTCTA CACGCCGCCG
GTTTCGTGCG GACTGCTGCC CGGGGTGTAC CGGCAGCACG TGCTGCGCAC GCGTCCGGAT
GTCGAAGAAC GGGTGCTGAC GCTGGCCGAT CTGCGCCGGG CCGAGGCGCT CTACGTATGC
AATGCCGTGC GCGGCTGGCG ACCGGCCGTG CTGGCGGTGC CCGAACCGGT GCTCACAACG
CTCTGA
 
Protein sequence
MHPLLLPGTV WLDTAQPDEE NRRSLLFMRP VRVLQADTPE QVPALLRALD GAVAAGYYVA 
GYMAYEAGYA LAPVPLQVPD ETGPLAWFGV YEMPHVLAPA STAALAGKTG DYAVRDLHLA
LSREAYRERV QHIRALIREG EVYQINFTLP LRFRFEGDPI AFFLALRRQQ PVPYAAFVNT
GERLVLSLSP ELFFRRNGEQ IYTRPMKGTA RRSSLPEEDA RLAEALRTDE KNRAENLMIV
DLLRNDLSVC CEPGSVAVSE LFRVEAYPTV WQMTSTVTGR LRSGVGYAEL FRALFPSGSV
TGAPKLRAMQ HIARLEPAPR GVYCGAIGYA APDGEAVFNV AIRTLELAGS EGRMGVGSGI
VWDSDPDAEY EECWLKGQFL RAAAEPFALI ETMRCEQGRI PLLELHLERL RRSAAHFGFA
LDEGRVRAQL EQVQQALDPA KVWRLRLTLE VSGQTQLTTA ELEPEPDRPW RLCVARERLD
PSDPLRYHKT TRRAHYEAAY LQAQAAGFDE VLFLNTRDEV CEGSRTNLFV QLDGRLYTPP
VSCGLLPGVY RQHVLRTRPD VEERVLTLAD LRRAEALYVC NAVRGWRPAV LAVPEPVLTT
L