Gene Rmar_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_1042 
Symbol 
ID8567683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp1192020 
End bp1193510 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content68% 
IMG OID 
Productanthranilate synthase component I 
Protein accessionYP_003290322 
Protein GI268316603 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTTCG AGCGCTTTCA GGCATTGATC GACACGCACC GGACGGCCGG CCACACGCAC 
CTGTTCGTGC CGGTCTTCCG GCGGCTGGGC GCCGACCTGC TCACGCCCGT GTCGGCCTTT
CTGAAACTGC GCGGGCATGA ACCCGGCGCT TTTCTGCTGG AAAGCGTCGA AGGCGGCGAG
AAGCTCGGCC GCTACTCGTT CCTGGGCGTG CGACCCTACC TGACCGTGGA GGTGCGCGAC
GGACAGGTGA CGCTGCAACG GGCGAATGCC GCGCCCGAGA CAAGTCCGGA CGACTTCTTC
GCCACCATGC GGCGGTTGCT CCGCCGGTAT CATCCGGTCC AGGTGCCCGA GCTGCCGCGC
TTTACGGGCG GGGCCGTGGG CTACCTGGGC TACGATATGA TCCGGCAACT GGAGCGCCTG
CCTGCCCCGC CGCCCGACGA CCTGGGCCTG CCCGACGCCC GCTGGAACTT CTACGACACG
GTGGTGGCCT TCGACCACGT GCGGCACCAG CTCGTGCTCA TGGCGGGCGT TTTCGTGGAG
CCCGAGACCG ACCTGCGTGC GGCCTACGAC GAGGCCGTCG CCCGTCTGGA CGCGCTCACC
GACACCCTTT CGCACGCGCC GCTGGAGGCC CCCGAGCCGG TCTCGCTCCC CGAGACGCCG
CTCACGTCGA ACTTCACGCG GGAGGATTTC TGCCGGGCGG TCTGCCGCGC CAAAGACTAC
ATCTACGAAG GCGATATTTT CCAGGTGGTG CTCTCCCAGC GGTTTGCCAC GCCGTACGCG
GGCGACCGCT TCAATCTGTA CCGGGCGCTC CGCCAGGTCA ATCCGTCGCC CTACCTGTTC
TATATCGATT TCGGCGACCT GGCGCTGATC GGTTCGTCGC CCGAGGTGCT CGTGCGCGTC
GAGCACGGCC GGGCCGAGGT GCTGCCCATT GCGGGCACGC GGCCGCGCGG CCGCACGCCC
GAAGAGGATC GGGCGCTGGA GGCTGAGCTG GAAGCCGACC CCAAGGAACA GGCCGAGCAC
CTGATGCTCG TCGATCTGGG CCGCAACGAC CTGGGGCGTG TATGCCGCTT CGGGACGGTT
CAGGTGGAGC GTTTCGCGTT CGTCGTGCGC TACTCGCACG TGATGCACCT GGTGTCGCTC
GTGGCCGGCG AACTCGATCC GCGCTACGAC GCGCTCGATG CGCTGGCGGC GTGCTTTCCG
GCCGGCACCG TCAGCGGCGC GCCCAAGGTG CGGGCCATGG AGATCATCGA CGAGCTGGAG
CCCACGCGGC GCGGCGTCTA TGCCGGGGCG GTGGGCTACA TGGATTTTTC GGGCAATCTG
GACACCTGCA TTGCCATCCG CACCATGGTG GTCCGCAACG GCACGATCTA CGTCCAGGCC
GGGGCGGGCA TCGTGGCCGA CAGCGACCCC GAACGCGAGT ACGAGGAAAC CGTTAACAAG
GCCCGGGCGC TGGTCGAGGC CATGCGCGTC GCCGCGTCCG GTTTGCTTTA A
 
Protein sequence
MTFERFQALI DTHRTAGHTH LFVPVFRRLG ADLLTPVSAF LKLRGHEPGA FLLESVEGGE 
KLGRYSFLGV RPYLTVEVRD GQVTLQRANA APETSPDDFF ATMRRLLRRY HPVQVPELPR
FTGGAVGYLG YDMIRQLERL PAPPPDDLGL PDARWNFYDT VVAFDHVRHQ LVLMAGVFVE
PETDLRAAYD EAVARLDALT DTLSHAPLEA PEPVSLPETP LTSNFTREDF CRAVCRAKDY
IYEGDIFQVV LSQRFATPYA GDRFNLYRAL RQVNPSPYLF YIDFGDLALI GSSPEVLVRV
EHGRAEVLPI AGTRPRGRTP EEDRALEAEL EADPKEQAEH LMLVDLGRND LGRVCRFGTV
QVERFAFVVR YSHVMHLVSL VAGELDPRYD ALDALAACFP AGTVSGAPKV RAMEIIDELE
PTRRGVYAGA VGYMDFSGNL DTCIAIRTMV VRNGTIYVQA GAGIVADSDP EREYEETVNK
ARALVEAMRV AASGLL