Gene RoseRS_3860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3860 
Symbol 
ID5210842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4826158 
End bp4827348 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content64% 
IMG OID640597455 
ProductFormyl-CoA transferase 
Protein accessionYP_001278163 
Protein GI148657958 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0478269 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTGC CGCTCGAAGG ATTGCGCGTC CTCGATCTGA GCCGCGCACT TGCGGGTCCT 
TTCTGTTCCA TGATGCTCGG CGACCTTGGC GCCGATGTGA TCAAAGTCGA ACAACCGGGT
ATCGGCGACC ACACACGCGC CTGGGGACCG CCATTCGAAG GCGGTGAGAG CACCTACTTT
CTCAGTGTCA ATCGCAACAA GCGCAGTCTG GCGCTCGACT TTCGCGACGA GCGCGGCGCT
GCAGTTCTTC GCCGCCTGAT CGCCAGCAGT GACGTGCTGC TGGAAAACTT CGTTCCCGGC
ACACTGGACC GGCGCGGGTT CGGTTACGAT GCGTGCCGCG CCATTCGTCC TGATCTGGTG
TACTGCTCGA TCTCCGGTTT CGGTCAGGTC GGACCGGACC GAGAGCGCGC CGCGTATGAT
CAGATTGCGC AGGGGTTGGG CGGCCTGATG AGCCTGATCG GTGAACCCGG AGGACCGCCG
ATGCGGGTTG GGATTGCAAT CACCGATATT ATGGCAGGAA TGTTCGCGGC ATATGCCATT
CTGGCGGCGC TCTACCACCG CGCGCGCACC GGCGAAGGGC AACGGGTGGA TACGTCGCTC
CTGGAAGGGC AACTGGCGAT GCTGACCTAT CAGGCGGGCA ACTATTTCGC CACCGGTCGC
GCACCGGAAC GACCGGGCAA TCAGCATCCA TCGATCGTGC CGTATGGGGT GTATCGCGCC
GCCGATGGCT ATTTTACGCT CGGCGTCGGC ACCGATGATC TGTGGCTGCG CTTCTGTGAT
GCGCTCGATC TTGCCGACCT GCGTGATCAT CCCCGTTTCC GCACGAATGT GGCGCGCCTG
GCGCATCGCG CCGAACTGAA TGCGCTGCTC GAACCGGTGT TTGCGTCGTT GCGCGTTGCC
GACATCGAGC AAAGATTGAA TGCCGCCGGT GTGCCGTGCG GCGCGGTGCA CGACCTGGCG
CAGGTGTTCA CCGACCCGCA GGTGCAGGCG CTGGGGAGCG TCGTGACCAT CGAGCACCCG
ACCGCTGGCG CGATCCGGGT CGTTGCTCCA CCTTACCACT TCTCAGCGAC CCCGCCTGCG
ATCCGTCGTC CGCCGCCGCT GTTGGGGCAG CATACCGACG AGATCCTGGC GGAAATTGGC
TACGAACAGC ACGAGATTGC GACGCTCCGT TCGATCGGCG TGGTCGCATA G
 
Protein sequence
MPLPLEGLRV LDLSRALAGP FCSMMLGDLG ADVIKVEQPG IGDHTRAWGP PFEGGESTYF 
LSVNRNKRSL ALDFRDERGA AVLRRLIASS DVLLENFVPG TLDRRGFGYD ACRAIRPDLV
YCSISGFGQV GPDRERAAYD QIAQGLGGLM SLIGEPGGPP MRVGIAITDI MAGMFAAYAI
LAALYHRART GEGQRVDTSL LEGQLAMLTY QAGNYFATGR APERPGNQHP SIVPYGVYRA
ADGYFTLGVG TDDLWLRFCD ALDLADLRDH PRFRTNVARL AHRAELNALL EPVFASLRVA
DIEQRLNAAG VPCGAVHDLA QVFTDPQVQA LGSVVTIEHP TAGAIRVVAP PYHFSATPPA
IRRPPPLLGQ HTDEILAEIG YEQHEIATLR SIGVVA