Gene Rcas_3186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3186 
Symbol 
ID5540684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4142639 
End bp4143679 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content64% 
IMG OID640895307 
Productthreonine synthase 
Protein accessionYP_001433258 
Protein GI156743129 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000625844 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.196815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTTCG ACCGCTATGG CGCATTTCTG CCGCTCACGG AGCAAACGCC GCGCCTCAGC 
CTTGGCGAAG GCGATACGCC ATTGATCCAC GCGCCGCGCC TGGCGCGCGC CATTGGGGTA
CGCGAGTTGT TTCTGAAATA TGAGGGCGCC AACCCGACCG GCTCGTTCAA GGATCGCGGC
ATGGTCGTAG CCGTCGCCAA AGCCCTCGAA GCGGGCGCAA CCTCGGTGAT CTGCGCTTCG
ACCGGCAACA CCTCCGCCAG TGCGGCGGCG TATGCGGCGC ATGCCGGGAT TGAGTCGATC
GTCGTGGTGC CTGCCGGAAA AATTGCGCTT GGCAAACTGG CGCAGGCGCT GATGTACGGC
GCGCGGCTGC TGGTGATCGA GGGCAACTTC GACCAGGCGT TGCACATAGT GCGCGACCTG
GCGCAGACGT ATCCGGTCAC CATTGTCAAC TCGGTGAACC CCTACCGCCT TGAAGGGCAG
GCAACCGCCG CCTACGAAAT CTGCGATGCA CTCGGCGGTC CGCCAGACGC GCTCTGCCTG
CCGGTCGGCA ACGCCGGGAA CATCACTGCG TACTGGATGG GGTTCCGTCG CTATCACGAG
GCGGGGCGCA TCGACCGATT GCCGAGAATG CTCGGTTTCC AGGCGGAAGG CGCTGCACCG
ATTGTGCGCG GGCATCCGGT CGAACACCCG GAAACCATCG CAACCGCGAT CCGCATCGGC
AACCCGGCCA GTTGGTGCTA CGCACTCGAT GCGCGCGATC AGTCGGGCGG GTTGATCGAC
TGGGTGAGCG ATGATCAGAT TCTCCAAAGC TGGCGTGATC TGGCGCGCCT GGAAGGGGTG
TTCGTCGAAC CGGCATCGGC AGCCGGCATC GCCGGGTTGC GCAGAGTCAT CGCCGAAGGA
CGCGCCGAAC CGAATGCGCG CTATGTGGCG GTGCTCACCG GTCATGGACT GAAAGACCCG
GGGCTGGCGG TTGAACAATT CGATGTTCCT GAGCCGACGC CGGCGGACAT GGACGCCATT
CTTCGATGGT TGGGCTGGTA G
 
Protein sequence
MLFDRYGAFL PLTEQTPRLS LGEGDTPLIH APRLARAIGV RELFLKYEGA NPTGSFKDRG 
MVVAVAKALE AGATSVICAS TGNTSASAAA YAAHAGIESI VVVPAGKIAL GKLAQALMYG
ARLLVIEGNF DQALHIVRDL AQTYPVTIVN SVNPYRLEGQ ATAAYEICDA LGGPPDALCL
PVGNAGNITA YWMGFRRYHE AGRIDRLPRM LGFQAEGAAP IVRGHPVEHP ETIATAIRIG
NPASWCYALD ARDQSGGLID WVSDDQILQS WRDLARLEGV FVEPASAAGI AGLRRVIAEG
RAEPNARYVA VLTGHGLKDP GLAVEQFDVP EPTPADMDAI LRWLGW