Gene RoseRS_1122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1122 
Symbol 
ID5208069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1398540 
End bp1399760 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content64% 
IMG OID640594735 
Productlaminin G 
Protein accessionYP_001275479 
Protein GI148655274 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCGCT TTCCCACATT CCCGTTCAAC GAACGCTTCC AGCGCGCAGG ACGTGCGCTC 
GTGGCGCTGG CGCTGCTGCT GACCGTTCTG CCAGAATCCA TCCGGGCGCA GGGCGACTTC
TCGCTCCGCT TCTATGGCAC AGGACGCGAT GGCGTTGACC GCGTGATGAT CCCGCTCGAT
GCGCCGCCGC GTCCGGTCGA CGTTGGCGGC GATTTCACCA TCGAATTCTG GCTCAAGGCG
CTTCCCGGCG ATAATGCGGC GTCAGCCTGC TCTCCCGGCG AGGACAACTG GATCTACGGC
AATGTCGTCA TCGACCGCGA TGTCTACTTT GCCGGCGACT ACGGCGACTA CGGCATCTCG
CTCGCAGACG GGCGCATTGT GTTTGGCGTC AATAACGGAT CAGAAGGAAC AACCCTCTGT
GGCCAAACGA ACGTAACCGA CGGTCGCTGG CATCATATTG CGCTGACACG TTCCGCCTCC
AATGGCAGCC TGGCAATCTT CATCGATGGA CGCCTCGACG CCCGTGGGGA AGGACCGACC
GGCGACGTCA GTTACCGCGA CGGGCGTGCA ACCCAGTACC CCGCCGATCC CTTCCTCGTG
ATCGGCGCTG AAAAACACGA TGCCGGTCCC GAATATCCGT CGTTTCGCGG ATGGCTCGAT
GACGTTCGCA TTTCGCGCAT CATCCGCTAC CGTGGCGCGT TCACCCCGCC AACCGCTCCC
TTCACGCCCG ATGCCGACAC AGTTGCGCTC TATCACTTCA ACGAAGGCGC TGGCGCCACA
GTCCGCGACT CATCGGGCGC ATCGGGTGGT CCCAGTGATG GCGCTCTGCG CGTCGGCGGA
TCACCCGCCG GTCCTGCCTG GTCCGAAGAT ACCCCCTGGA TCAGTTCGAG CGCATCACCG
CCCTCACCGA CATCTCCGCC ATCACAGGGT GCGCCAGCGC TGTCACCGCA ACCATCGCCC
ACCACAATCA TTGCTGTCGT CACGAACGTC CCGCTGCCGA CAGACACACC CGTTCCACCA
ACACAACCAG CGAGTGCAGC GTCTCCCCCG ACGTCCACAT CCCTTCCCTC TCCAACACTA
CCGACCGCCG GAATCGTTGC ACCCTCGCCA ACGCCTGGCG CTGCGCCCGC GCCTCCCAAT
CCACCGTACT GGCTGCTCAT CATTGCGGTC GCCGGACTGG CGCTGGCTGG CGTCGGGGTG
GCGCTCATGC GCCGGAGGTA A
 
Protein sequence
MGRFPTFPFN ERFQRAGRAL VALALLLTVL PESIRAQGDF SLRFYGTGRD GVDRVMIPLD 
APPRPVDVGG DFTIEFWLKA LPGDNAASAC SPGEDNWIYG NVVIDRDVYF AGDYGDYGIS
LADGRIVFGV NNGSEGTTLC GQTNVTDGRW HHIALTRSAS NGSLAIFIDG RLDARGEGPT
GDVSYRDGRA TQYPADPFLV IGAEKHDAGP EYPSFRGWLD DVRISRIIRY RGAFTPPTAP
FTPDADTVAL YHFNEGAGAT VRDSSGASGG PSDGALRVGG SPAGPAWSED TPWISSSASP
PSPTSPPSQG APALSPQPSP TTIIAVVTNV PLPTDTPVPP TQPASAASPP TSTSLPSPTL
PTAGIVAPSP TPGAAPAPPN PPYWLLIIAV AGLALAGVGV ALMRRR