Gene Rcas_0021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0021 
Symbol 
ID5537478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp25868 
End bp27007 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content62% 
IMG OID640892186 
Producthypothetical protein 
Protein accessionYP_001430178 
Protein GI156740049 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000314696 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCACAC CCACCGACAC GCCACCCATC GCCGCACCGC ACCTCCCCAC GATAACGTCT 
GCTGTCTTGA CGATTTACCA GTTTTTCGAC ATCGGCGACG CCATCGACCT GGATCAGGCG
CAACGCTGCC TGAGCAACCC GTCGGAGCGG CGTGTTCGCC TGCGCACGCG CCAATCCGAA
AGTATTCGCA TCGCGCAGCC GCCGTTGCGC ATCGATCTTG GAAGTGTGCC GGTCACACTG
GCGGATGAGC AGCGCATCGG CGCGTTGCGC GCCGTCGTGT ACGATCTCGG CGCCGTCGAG
ATTGCCATTG AGATACGCCT GACCACGCCG CTATCGTGGG AGAACGTCGC CGATCTGTTT
GCTGCTGCGC AGGAACTGCC AGCCGAGATG ACAGAACGTA TTGCCGCTGT GCTCGATGAC
CTGGAAGCGC TCATTCGACC GGCGATTTCC CGTCCGCAAC GCTCAACAGT GGTCGAAGAC
TACTGTGTGC TGATCGTCGA ACGCCTCGCC CCGCCCTGCA ATGTCGCTGA ACTGTCGGAT
CATCCGGCGG TGCGCGCCGC GCTCCTGGGT GAGCGACGCA CCCTCAGCGC GGATGCGTCG
CGCCTGGTCA CCGCGCTGAG TTATTACTCG GATGATCTGG CGCTCCTCAG CTGGAACGGC
GCACTCCTGA TTGAATCGGA TGCCGCCGCA GCCGCGACGG CTGTCGATAT TCTGGCATTT
GCCAATGTCG AACTGCTGCT CATTCGATCC TACGACGCCG CGCTCGATGC GCGCCTGCCG
GAAGTTCATC GGCGGATTGC CCAGGCGCAG CGGCGTTTCA CAATGCCCAT CGTGCGCCGC
TACAGTCAGT TGCTCAGCGA TGTGCAGCGG CTGGTCGCAG AAGTGACCGA AGTGACCGAA
CAGATCGACA ACGCGCTCAA GGTCACCGAC GATGTGTACT GGAACCGACT CTACAGCGCG
GCGCTCAGTG TATTGCGCGT GCGTGTCTGG CGCGATGGCG TCGATCACAA ACTGGCGCTG
CTGCGCGAAA CGTATGCCAT GCTCCATGCC GATGCCGATT CGGAACGCGC CGCCGCTCTC
GAATGGGCCA TCGTGCTCCT GATCGTCTTC GAGATCGTAA TGGCGCTGCT GGGGAAATGA
 
Protein sequence
MITPTDTPPI AAPHLPTITS AVLTIYQFFD IGDAIDLDQA QRCLSNPSER RVRLRTRQSE 
SIRIAQPPLR IDLGSVPVTL ADEQRIGALR AVVYDLGAVE IAIEIRLTTP LSWENVADLF
AAAQELPAEM TERIAAVLDD LEALIRPAIS RPQRSTVVED YCVLIVERLA PPCNVAELSD
HPAVRAALLG ERRTLSADAS RLVTALSYYS DDLALLSWNG ALLIESDAAA AATAVDILAF
ANVELLLIRS YDAALDARLP EVHRRIAQAQ RRFTMPIVRR YSQLLSDVQR LVAEVTEVTE
QIDNALKVTD DVYWNRLYSA ALSVLRVRVW RDGVDHKLAL LRETYAMLHA DADSERAAAL
EWAIVLLIVF EIVMALLGK