Gene RoseRS_3080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3080 
Symbol 
ID5210048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3870694 
End bp3871734 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content65% 
IMG OID640596671 
Productthreonine synthase 
Protein accessionYP_001277393 
Protein GI148657188 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0615422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00256944 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGTTCG AACGCTATGG CGCATTTCTG CCATTGACCG GGCAAACGCC GCGCCTCAGC 
CTTGGCGAAG GCGATACGCC GCTGATTGCC GCACCCCGCC TGGCGCGCTC CATCGGCGTG
CGTGAGTTGT ACCTGAAGTA CGAAGGCGCC AACCCGACCG GATCGTTCAA AGATCGCGGA
ATGGTGGTGG CGGTCGCCAA AGCCATCGAA GCCGGCGCTA CCTCGGTCAT CTGCGCTTCG
ACCGGCAATA CCTCGGCGAG CGCGGCGGCA TATGCGGCGC ATGCCGGCAT CGAGTCAATT
GTGGTTGTGC CTGCCGGAAA GATCGCCCTG GGCAAACTGG CGCAGGCGCT GATGTATGGC
GCGCGGTTGC TGGTGATCGA GGGCAACTTC GACGAAGCGC TGCGGATTGT GCGCGATCTG
GCGCGGCAGT TTCCGGTGAC GCTGGTCAAC TCCGTCAATC CGCACCGCCT CGAAGGGCAG
GCGACGGCAG CCTACGAGAT CTGTGATACG CTGGGTGGTC CGCCCGATGC GCTCTGTCTG
CCGGTCGGCA ATGCGGGGAA TATCACCGCG TACTGGATGG GATTCCGCCG GTATTACGAA
GCAGGCAGGA TCAACCGCCT GCCGAAGATG CTCGGCTTTC AGGCGGAGGG CGCAGCGCCG
ATTGTGCACG GCGCTCCGGT GGAACATCCT GAGACGGTTG CGACCGCGAT CCGGATCGGC
AACCCGGCGA GCTGGTGTTA TGCGCTCGAT GCGCGCGATC AGTCGGGAGG ATCGATCGAC
GCCGTCAGCG ATGAGCAGAT CCTGCGGAGC TGGCGCGACC TGGCGCGCCT GGAAGGGGTA
TTCGCGGAGC CGGCATCGGC AGCCGGCGTC GCCGGGTTGC GCAAAATGGT CGCCGAAGGG
CGCGCCGATC CGGATGCATG CTATGTGGCG GTGCTGACCG GTCATGGACT GAAAGATCCC
GGACTGGCGG TGGAGCAATT CGAGACGCCT CAGCCGGTGC CGGCGGATAT GAATGCCATT
CTCCGATGGT TGGGCTGGTG A
 
Protein sequence
MLFERYGAFL PLTGQTPRLS LGEGDTPLIA APRLARSIGV RELYLKYEGA NPTGSFKDRG 
MVVAVAKAIE AGATSVICAS TGNTSASAAA YAAHAGIESI VVVPAGKIAL GKLAQALMYG
ARLLVIEGNF DEALRIVRDL ARQFPVTLVN SVNPHRLEGQ ATAAYEICDT LGGPPDALCL
PVGNAGNITA YWMGFRRYYE AGRINRLPKM LGFQAEGAAP IVHGAPVEHP ETVATAIRIG
NPASWCYALD ARDQSGGSID AVSDEQILRS WRDLARLEGV FAEPASAAGV AGLRKMVAEG
RADPDACYVA VLTGHGLKDP GLAVEQFETP QPVPADMNAI LRWLGW