Gene RoseRS_4114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4114 
Symbol 
ID5211097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5154725 
End bp5155924 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content62% 
IMG OID640597702 
Producthypothetical protein 
Protein accessionYP_001278408 
Protein GI148658203 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCGT CTGCCCTGCT CGCTGGCGCC GCGATGCGCC GGATTACTCC ACAACTCGAT 
GCCCGTCCGG TTTTTCTGGC GGGATTTCAG AATAACCGTC GCGCTACAGC GATTGACACC
GACCTGTATG TGCGCGCGCT TGCGCTGCGC CTCGATGAGC GGATTGCGGT AATTGCCGTG
TGTGACCTGA TCGGTCTCGA CCGCAGCGAT GTGCTCGATG TGCGCACTGC GCTCGATGCG
CGCGGCATCG ATCCGTCCGG TCTGGTTGTC GCCTGCACCC ATACCCACAG CGGACCGGAT
ACGCTGGGAT TGTGGGGACC AGACCGGTAC GTCAGCGGGG TGGATCCGCT CTACCTGGCA
GCGGTCAAAC AGGCAATCGT CGATGCGGCA ATAGAAGCGC TGACATTCTG CTGCCCGGCG
CGCATGCGCT GCGCAATGAC CCGTCTGCCG GGATATATCG CCAACTTCCG TGATCCGGGC
ATTGTTGATG ATGACGTGGC GGCGCTCCAG TTTGTGAAAC TGGATGGCGA AGTGATCGCC
ACTCTGCTGA ACCTGGCGTG CCATCCAGAA GTGCTGGACG GCGACAGCAC GCTGATCTCG
GCGGACTATG CCGGGTATGC GTGTCGAGAA GTGGAAACGC GGGTCGGCGG AGTGGCGTTG
CATGTTTCTG GCGCGCTGGG CGGAATGCTA TCCCCCGACA CGCGCGACCG CACCCCTGCC
TGGGCGGAGC GCATGGGGCG CGCCTATGCC GATGCAGCAC TGGCGGCACT GGAGGCGTCG
GCGGTGATCA ATGCTGATCG CCTGGAAGTG CGGCGCACCG AATTCGACCT GCCGCTGGTC
AATCCGCTGC TGCTCATGGC GCAGCAGATG GGAGTATTGC GGGTGCGCCA ACCGGTGAAC
GGTGCGATTA CAACCTCGTG CACCTTCATC GATCTCGGTG CAGCGCAGAT CATTACCGTT
CCCGGCGAAC TGCTGCCACG GCTGGGGTTC GCAATCAAAG CCGCAATGCC CGGTCCCTGC
AAGATTCTCG TCGGTCTGGC GGACGATGAA ATCGGCTACA TCCTGCCCGA TGACGAATTC
GTGCCCCCCG CCGATTACCT GAACCCTGGC AGGCAGTATG AAGAGAGCAT GTCAGTCGGA
CCGACCACTG GCTCACGCAT CCTGGCAGCG GCGCGGGAGT TGATCGGAGA TCATCCGTGA
 
Protein sequence
MNASALLAGA AMRRITPQLD ARPVFLAGFQ NNRRATAIDT DLYVRALALR LDERIAVIAV 
CDLIGLDRSD VLDVRTALDA RGIDPSGLVV ACTHTHSGPD TLGLWGPDRY VSGVDPLYLA
AVKQAIVDAA IEALTFCCPA RMRCAMTRLP GYIANFRDPG IVDDDVAALQ FVKLDGEVIA
TLLNLACHPE VLDGDSTLIS ADYAGYACRE VETRVGGVAL HVSGALGGML SPDTRDRTPA
WAERMGRAYA DAALAALEAS AVINADRLEV RRTEFDLPLV NPLLLMAQQM GVLRVRQPVN
GAITTSCTFI DLGAAQIITV PGELLPRLGF AIKAAMPGPC KILVGLADDE IGYILPDDEF
VPPADYLNPG RQYEESMSVG PTTGSRILAA ARELIGDHP