Gene Rcas_3664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3664 
Symbol 
ID5541166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4792077 
End bp4793141 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content60% 
IMG OID640895784 
Producthypothetical protein 
Protein accessionYP_001433731 
Protein GI156743602 
COG category[C] Energy production and conversion 
COG ID[COG1592] Rubrerythrin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.60307 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATGCG GTACAATGCA TGCCAGATGG TTCCGATTGA ACAAAGAAGC GTATTGCCGT 
ATGACCGATT CGGATCGTGA AGCGCGCCAG ATGTTCGCTC TCGGGAGCAT GGGACATGTC
GCATACCATT TATGGGCGGA ACAGGCGCGG CGAGAGCATC GTTTCAATCT TGCCCGCTTG
TTCGATGCGT TGAGCGCTGC GCGACTGGCG CGCGCCGGTC AGGCGTTTCG TCGTTTGAAT
TGGGTGCGCT CGACGGCTGA AAATGTCGTC AGCGCCTTTT CTGGCGCAGT CATTGGTGAT
ATCGACGCTG ATCGGATCAC CGGCGTGACG CCGCTTGCGC GGGAGTTGCT GGCGCGGGCG
CAGCGCGCCG TAAGTGAGGG ACGCGATCTG CGCGCCGGGG AGCTTGGCGA TCTGTTCGTC
TGTACGACGT GTGGCGAGAT CTGCGAAGGT AAACTCGAAG GCGCCTGCCG ACGCTGTGGC
ACCGTTCCCG AAGCGCATCG GGCATTTCGG GCCATTGAAG CGATGGGTAC GCTTGGTCCG
CATGCGATTA TGGCATTTCT GGAACGGACG GAAGAGGCGC TGCGCACCCT TGTGGCGGGT
CTCGACGACG ATTTCCTCGC GCGTCGCCTG AACGACGCCA CGCCATCGCT TAAGGAGTTG
ATCGGTCATC TTGCCGATAT GGACGCGATC TTTCGTCAGC GCGCCTGGTT GCTCCTCGAA
ACCAATCAGC CGACGCTTTC ACCCGCACAC CCGCCATCGC TCGAATCTGC GGCAATGTAC
CGCGACCAAC CGATAGATGC TGTGCTCGAT GCCTATCATA CGACACGCGC GCAGACGTTG
AGTCTGTTGC GTGGGTTGAC CAGCGCCGCC TGGCATCGCG AAGGGTATCA CGAGGTGTAT
GGGGTGATCA ATCTGTTGCA CCAGGCGAAC TGGCTCATTT CGCACGAACG AGCGCATCTC
GTCGAAATGG CGCAGATCCG TCACGACCTG ATCGCAACTG ATCGGCGCTA CGCTGAGACG
ACGGTTGCGG ACATTGTTGT GACCGCTTCG AACGAAGGCG AGTGA
 
Protein sequence
MRCGTMHARW FRLNKEAYCR MTDSDREARQ MFALGSMGHV AYHLWAEQAR REHRFNLARL 
FDALSAARLA RAGQAFRRLN WVRSTAENVV SAFSGAVIGD IDADRITGVT PLARELLARA
QRAVSEGRDL RAGELGDLFV CTTCGEICEG KLEGACRRCG TVPEAHRAFR AIEAMGTLGP
HAIMAFLERT EEALRTLVAG LDDDFLARRL NDATPSLKEL IGHLADMDAI FRQRAWLLLE
TNQPTLSPAH PPSLESAAMY RDQPIDAVLD AYHTTRAQTL SLLRGLTSAA WHREGYHEVY
GVINLLHQAN WLISHERAHL VEMAQIRHDL IATDRRYAET TVADIVVTAS NEGE