Gene Rcas_4058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4058 
Symbol 
ID5541569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5266803 
End bp5267873 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content61% 
IMG OID640896170 
Productputative virion core protein (lumpy skin disease virus)-like protein 
Protein accessionYP_001434108 
Protein GI156743979 
COG category[S] Function unknown 
COG ID[COG4260] Putative virion core protein (lumpy skin disease virus) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.13522 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0384767 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACTGC TCGACATTGT TGAATTCGTC GATCCTACGG GCAAAACGCT GGTTGCGCGC 
GTTCCGCCGG ACGGCAATGG CGAATTGCGT CTCGGTTCTC AGTGCATTGT GCGCGAAGGG
CAGCTCGCTT TCTTTGCGCG TGACGGGCGT TTCCTCGATA TGCTCATTCC GGGGCGGCAC
ACGTTGACGA GCAACAACAT TCCGCTGCTG ATCGACTTCA TCAAACTGCC GTTCGGCAGC
AAAAGCCCAT TCCGCGCCGA TGTCTACTTC GTCAGCCTGC ACCAGCACAC CGATCTGAAA
TGGGGAACGC CGCAACCAAT CCCGATGCGC GATGCGCAGT TCGGCATGGT GCGGTTGCGG
GCGTTTGGCA CATTTATCAT CCAGGTAGCC GAACCCCGCC GCTTGCTCAC TGCCGTGGTC
GGCACCCGCG GTCGCCTGAC AGTCCAGGAT GTCGAGGAGC AACTGCGCAG TTCGATTATT
GCGCGCGTTG CTGATGTCAT TGCCGAGCGC ATGCGTGAGC GTCAACTCTC GGTGCTCGAC
CTTGCGACCG AGTATGATGA ACTCTCGGAA ATGGCGCACG AAGTGTTGAA GGACGACTTT
GCCGCGCTTG GCTTGCAGTT GACGCGCTTC TACATCAACA CCATCAGCGT GCCCGAAGAA
CTCGAGCGGC GGCTCGATCA GGTCGGCGGC GTGGCAGCGT TTGGCGGATT GGGCGACTAC
ACGCGCTTCA AGGCGGCTGA AGCGCTACAC GATGCCGCGC GCACCGGAGG CGACAGCACC
GTCGGCGCAG GCATCGGGCT GGGTGCGGGA ATGAACCTTG GGGCGCTCAT GGGTCAGGTT
CTTCAGCAGC AGACGCCGAC ACAATCGCCG CCACAGCCAG CGCCGATAAC GGCGACATCC
TCACAGGCGG CAACGAAGAC ATGCCCGCGC TGTAACACGG CTATGCCTGC GAACGCCAGG
TTTTGCAGCG AGTGCGGCGC GTCGCTCCTA CCGGCGACAT GCCCGCAATG TGGACATGCA
GTGACGACTG GGGCGAAGTT CTGCATCGAA TGCGGTGCGG CGCTGAAATA A
 
Protein sequence
MPLLDIVEFV DPTGKTLVAR VPPDGNGELR LGSQCIVREG QLAFFARDGR FLDMLIPGRH 
TLTSNNIPLL IDFIKLPFGS KSPFRADVYF VSLHQHTDLK WGTPQPIPMR DAQFGMVRLR
AFGTFIIQVA EPRRLLTAVV GTRGRLTVQD VEEQLRSSII ARVADVIAER MRERQLSVLD
LATEYDELSE MAHEVLKDDF AALGLQLTRF YINTISVPEE LERRLDQVGG VAAFGGLGDY
TRFKAAEALH DAARTGGDST VGAGIGLGAG MNLGALMGQV LQQQTPTQSP PQPAPITATS
SQAATKTCPR CNTAMPANAR FCSECGASLL PATCPQCGHA VTTGAKFCIE CGAALK