Gene Rcas_1631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1631 
Symbol 
ID5539107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2107875 
End bp2108945 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content60% 
IMG OID640893768 
Productbasic membrane lipoprotein 
Protein accessionYP_001431741 
Protein GI156741612 
COG category[R] General function prediction only 
COG ID[COG1744] Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0120157 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGGC CGGTTTCTCT CATCATCCTG CTGCTCGTCT CGGTGTTGAT TGCCGCATGT 
GGCGGGCAGC AGACAGTCAC CCCAACGACC GCCCCCACCG GCGCAACTGC CGCGCCGGCC
AGACCCACTG AAGCAGATTG CCCCAAACCC GAAGTTCTGT GTGTCGGTCT CGTGACCGAC
GTGGGCGAGA TCGACGACAA GAGCTTCAAC CAGTCCGCCT GGGAAGGGGT CAAGCGCGCC
GAAGCCGAAT TGGGCGCGAT CGTCAACTAT GTCGAGACAA AAGACGCCAA GGATTATGAT
GCGAACATCG CGTTGTTCAC CGAGAAGGGG TACGATGTGA TCGTTACCGT TGGCTTTGCG
CTCGGCGAAG CGACGGCGAA GGCTGCGGCG GAGAATCCGA ATGTGAAGTT CATTGGCGTC
GATCAGTTCC AGGCAACGGA AATTCCGAAT GTTGCGGGCC TGATCTTCGC TGAGGACAAG
GCGGGTTTCC TGGCGGGCGT GCTTGCAGCG GAAATGTCGA AGAGCAACAA GATCGCCGCT
GTGCTCGGCA CCAACCTGGT TCCGCCGGTT GTCGCCTTCA AGGAAGGGTA CGAGAATGGC
GCGAAGTATG TCAAGCCCGA TATCCAGGTG ATCTCGACCT ACCATCCGGG TGGTCTCGAT
ACGGCGTTCA CCGACCCGAA GTGGGGCGCC GACCAGGCAA AACTGGCGAT TGACCAGGGC
GCCGATGTGA TCTTCGGCGC TGGCGGCAAG ACTGGCAACG GCGCGCTGAT CGAGACGGCA
GCCAACCAGG GGGTGTACTG CATCGGTGTC GATACCGATC AGTGGGAGAC GGTGCCGGAG
GCTCGCCCGT GCCTGATCTC GAGCGCAATG AAACTGATTA CGCCGGGTGT GTTCGACCTG
ATCAAGAAGG CGCAGGAAGG CGCGTTCCCT TCTGGCAATT ATGTCGGTGA AGTCGGGCTG
GCGCCGTTCC ACGACTTCGA TGCCCAGATC CCGGCTGAGG TGAAAGCGAA GATCGCCGAG
ATCGACAAAG GCCTGCGCGA TGGTTCAATT TCGACGGGAT ATACTCCGTA G
 
Protein sequence
MKRPVSLIIL LLVSVLIAAC GGQQTVTPTT APTGATAAPA RPTEADCPKP EVLCVGLVTD 
VGEIDDKSFN QSAWEGVKRA EAELGAIVNY VETKDAKDYD ANIALFTEKG YDVIVTVGFA
LGEATAKAAA ENPNVKFIGV DQFQATEIPN VAGLIFAEDK AGFLAGVLAA EMSKSNKIAA
VLGTNLVPPV VAFKEGYENG AKYVKPDIQV ISTYHPGGLD TAFTDPKWGA DQAKLAIDQG
ADVIFGAGGK TGNGALIETA ANQGVYCIGV DTDQWETVPE ARPCLISSAM KLITPGVFDL
IKKAQEGAFP SGNYVGEVGL APFHDFDAQI PAEVKAKIAE IDKGLRDGSI STGYTP