Gene Rcas_2458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2458 
Symbol 
ID5539939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3158506 
End bp3159672 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content63% 
IMG OID640894588 
Producthypothetical protein 
Protein accessionYP_001432556 
Protein GI156742427 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.21472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATCCA CGGTGATGCT TCCTCAACCT GACCGCTCGT TACGCGCCTA CACCTTTCGT 
CCTCCCGCTG CGTTTGCGCC GCCAACTGCA CCGCTGCGCA GCCGTGTCTT CTTTGCTGCC
GCGCATATCG TCGCCGACCC GCTCGCCGAT GTGACTCCCG CATCGCCTCC GGCGCTCGAT
TGGGAGGCGA CGCTGGCGTA CCGCCGTTAT CTCTGGTCGC TGGGTCTGGG TGTCGCTGAA
GCCATGGACA CGGCGCAGCG CGGTATGGGG CTCGACTGGC TCACGGCGCG TGATCTGATC
ATGCGTTCTG TGGCGGAGGC GCGCGCCGTC GGCGGTGTGA TCGCCTGCGG CGCAGGGACC
GATCACCTGC CGCCCTCCCC CAACCTCACG CTCGATCAGG TTGAATCGGC GTATGCTGAA
CAGGTTGAAG CGGTCGAACA TGCAGGCGGG CGCGTGATCC TGATGGCAAG CCGCGCGCTT
GCCGCATGCG CGCGCGGTCC CGACGATTAT GCCCGTGTCT ATGGGCGGAT TCTGTCGCAG
GTGCGTGAGC CGGTCATCAT TCACTGGCTC GGCGATATGT TCGATCCGCA CCTCGCCGGG
TATTGGGGCA GCCGCAACCT CGATGACGCG ATGATGACGG CGCTTGCCAT CATTCACGAC
CATGCGGCGA AGATCGACGG CATTAAGATC TCCCTGCTCG ATGCACACCG TGAGGTTCAA
ATGCGCCGAC GGTTGCCGCC GGGGGTTCGT ATGTACTCCG GCGATGATTT CAACTACCCC
GATCTGATCC TGGGCGACAA TCAGGGATAT AGCGATGCGT TGCTGGGTAT CTTCGACGCA
ATTGCTCCGG CGGCATCGGC AGCGTTGCAG GCGCTCGATG CCGATGATCC GTCGCGCTTT
CAGGCGATCC TCGAACCGAC CGTGCCGCTC TCGCGGCACA TCTTTCAGGC GCCGACCTAC
TACTACAAAA CCGGCGTTGT CTTCCTCGCC TATCTCAACG GACATCAGAA CCATTTCCGC
ATGGTCGGCG GTCAGGAAAG CGCGCGCTCC ATCGTCCATC TGGCGCAGTT GCTGGTCCTG
GCGGATCAAG CGGGCGTGCT GCGCGACCCG GACCTCGCTG CTGCGCGAAT GCGCCACGTG
CTGGCGCTGG CAGGAATTGA GGGGTGA
 
Protein sequence
MPSTVMLPQP DRSLRAYTFR PPAAFAPPTA PLRSRVFFAA AHIVADPLAD VTPASPPALD 
WEATLAYRRY LWSLGLGVAE AMDTAQRGMG LDWLTARDLI MRSVAEARAV GGVIACGAGT
DHLPPSPNLT LDQVESAYAE QVEAVEHAGG RVILMASRAL AACARGPDDY ARVYGRILSQ
VREPVIIHWL GDMFDPHLAG YWGSRNLDDA MMTALAIIHD HAAKIDGIKI SLLDAHREVQ
MRRRLPPGVR MYSGDDFNYP DLILGDNQGY SDALLGIFDA IAPAASAALQ ALDADDPSRF
QAILEPTVPL SRHIFQAPTY YYKTGVVFLA YLNGHQNHFR MVGGQESARS IVHLAQLLVL
ADQAGVLRDP DLAAARMRHV LALAGIEG