Gene RoseRS_3331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3331 
Symbol 
ID5210308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4178384 
End bp4179589 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content59% 
IMG OID640596929 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_001277642 
Protein GI148657437 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.552746 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.24215 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGTTAT TCGACACCCT GCGCGGACAG AAAGCCGAGT TGCTCATCCC GCGCGACCGC 
CCGCTCACGC TCTATGTCTG CGGCGTCACT CCCTACGATA CCACGCACGT TGGTCACGCC
CATACGTTTC TGATCTTCGA TGTGCTCATC CGCTATATTC GTCACTGCGG CGGTACGGTT
CGCTACTGTC AGAATACCAC CGACGTCGAT GATCCATTGT TCGAGCGCGC CGCACGTGAT
GGCATCCCGT GGGACGAACT GGCGCGTCGT GAAACGGAAC AGTTCGTCAA AGACTGTCGC
GCGCTCAATC TGATCCCTCC CGATTTCTTT CCAAAGGCTT CGGAAGAGAT TGCCGCGATG
ATCCCGATCA TCGAACGGTT GATCGAATTG GGGCATGCCT ACGTGCGAAA TGGCAATGTC
TATTACGACG TCTCAACCGA ACCGACCTAT GGCGCAATGG CGCGGGTGAG CGGGTATGAG
GAACTGCTCG CGCTGGCGAA CGAACGCGGC AACAATCCGA ACGACCCGCT CAAAGACGAT
CCGCTCGATT TCGTGCTCTG GCAGCGCAGT CGTCCCGGCG AACCGACCTG GCCCAGCCCG
TGGGGAAGGG GACGCCCCGG ATGGCATATC GAATGCACGG CAATGGCAAC CCGCTACCTC
GGTCCGCAAC TCGATATCCA TGGCGGCGGG CGTGATCTGA TCTTCCCGCA CCATCCTTCC
GAAATTGTGC AGACCGAACC GTATACCGGC AAACGCCCCT TTGTTCACTT CTGGGTTCAC
GGAGGGCTGG CATGGCTTGA TGGTCAGAAG ATGAGCAAGT CGCTCGGAAA TCTGGTGTTT
ATCAAGGATG CGCTCAGGCA GCACAGCGCC GATGCGCTCC GCTGGTACCT GCTTTCGTTC
CCCTACCGCG ACGATTTTGA GTATGTGCGC TCCGACGTAC CGCAGGCGGA ACAGAAGGTT
GGACAACTCA AAGCGGCGCT GGCAGCGCAG GGCGATCCCA GAGGCGAGCG GCTGAACCCC
GAACCGTTCC GCCAGGCATA CTTCGCCGCG CTCGATGATG ATCTCGATAC GCCGAAGGCG
CTGGCGCAGA TCAGCGTTCT GAGCGGCGCC ATCCTTGAAG CGGCTTCATC AGGATATGAT
GTGAGTGATG CCCAATCCGC GCTCCGTGAT ATGGCGAACG TTTTCGGTTT CTGGGCGGCG
GCGTGA
 
Protein sequence
MWLFDTLRGQ KAELLIPRDR PLTLYVCGVT PYDTTHVGHA HTFLIFDVLI RYIRHCGGTV 
RYCQNTTDVD DPLFERAARD GIPWDELARR ETEQFVKDCR ALNLIPPDFF PKASEEIAAM
IPIIERLIEL GHAYVRNGNV YYDVSTEPTY GAMARVSGYE ELLALANERG NNPNDPLKDD
PLDFVLWQRS RPGEPTWPSP WGRGRPGWHI ECTAMATRYL GPQLDIHGGG RDLIFPHHPS
EIVQTEPYTG KRPFVHFWVH GGLAWLDGQK MSKSLGNLVF IKDALRQHSA DALRWYLLSF
PYRDDFEYVR SDVPQAEQKV GQLKAALAAQ GDPRGERLNP EPFRQAYFAA LDDDLDTPKA
LAQISVLSGA ILEAASSGYD VSDAQSALRD MANVFGFWAA A