Gene RoseRS_2471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2471 
Symbol 
ID5209440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3056663 
End bp3057868 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content62% 
IMG OID640596076 
ProductGCN5-related N-acetyltransferase 
Protein accessionYP_001276798 
Protein GI148656593 
COG category[R] General function prediction only 
COG ID[COG4552] Predicted acetyltransferase involved in intracellular survival and related acetyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGG TCGATCTCCA GTATCGCGCC GTTGCGGTTG ATGACGATCA GTTCGCCTTC 
AATGCAGCGC TGGCATATAA CATATCGCTC GACGAGGCGC GGCGCAATCT GCGGGCTCAT
GCGCCCGGCG ATCATCGCGG ATTGTACCGC AATGATCGCC TGGTGACGCA GTGTATTCTG
TATCCACTGC GCCTGGCAAA CGGCGCAGGC GGGATGATCG CAGCAGGCGG CATTGGAGCG
GTTGCCACGC CGCCCGAAGA GCGACGACGC GGGTATGTGG AACGTCTGCT CCGCGCGGTG
TGCGACGAAC TGCGCAGCCA GAACGTGCCG CTGAGTATGC TGGCTGCGTT TAAGGAGTCA
TTCTACCGGC GCTACGGATG GGCGACCTTT TATGAGGCAC GGCGGTTGAG TGGACCGCCG
GATCGGTTTG CTCCGTTTTG CGCGCATCGA TCAGGGGAAT GGGTGCGCCT CGACGATAGT
GTGGCGGCGG CTACGGAACT GGACGCAATC TATCGCGGCG CGCTGCGTGG GCGCTTCGGT
CCGCTCGAAC GCGATGCCCT CTGGTGGCAA ACGCGCGTAT TCCGTTCGGG AGACGCTTCC
CCACGCTGGA TCTATGTCTG GCGTGACGAC GCCGGTCACG GACGATCCTA CGTCATCTTT
TCCGTCGAAC GCGGTGAGCA AGGGCGAGTA GTGCGTTGCC GCGAAACGGT GGCGCTCGAC
CCGCAGGCGC GTGCGCAGAT CTTCGCGTTC TTCGCGGCAT TTGCCGATCA GTGCGCCGAG
GTAGTTTTCC ATGCACCCGC CGATGCTCCG GTGCAGATGC TGATGCCCGA TCCGCTGGAA
TGTACCATGG AGCCGGGCAG TATGCTGCGG ATCGTGGATG TCGCGCAGGC GCTGGAAGCA
TACGCCTTCC CACGCGATGT CGCCGGTCGG GTGACGCTGC GCATCGCCGA TGACTGGCTG
GAACACAACA ACGCGGTTTT TCAGTTGGAG ATCGAAGGAG GCGTGGCGCG AACGACCCGT
CTGAATGATG GGATCAATGC CGATGTGCGC TGCGATATAC GAACATTGAC GCACATCTAC
AGTCGCGCCA TCCGTCCACG GACTGCGGCG GCATTTGGTT TGCTCGATAT CCATACACGC
CCGGCGCTGG CGCTGCTCGA ACGTCTGTTT GCAGGGCTGG CGCCGTATGC GTCGGACTGG
TTTTGA
 
Protein sequence
MTTVDLQYRA VAVDDDQFAF NAALAYNISL DEARRNLRAH APGDHRGLYR NDRLVTQCIL 
YPLRLANGAG GMIAAGGIGA VATPPEERRR GYVERLLRAV CDELRSQNVP LSMLAAFKES
FYRRYGWATF YEARRLSGPP DRFAPFCAHR SGEWVRLDDS VAAATELDAI YRGALRGRFG
PLERDALWWQ TRVFRSGDAS PRWIYVWRDD AGHGRSYVIF SVERGEQGRV VRCRETVALD
PQARAQIFAF FAAFADQCAE VVFHAPADAP VQMLMPDPLE CTMEPGSMLR IVDVAQALEA
YAFPRDVAGR VTLRIADDWL EHNNAVFQLE IEGGVARTTR LNDGINADVR CDIRTLTHIY
SRAIRPRTAA AFGLLDIHTR PALALLERLF AGLAPYASDW F