Gene Rcas_1220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1220 
Symbol 
ID5538687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1577621 
End bp1578862 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content61% 
IMG OID640893353 
ProductErfK/YbiS/YcfS/YnhG family protein 
Protein accessionYP_001431335 
Protein GI156741206 
COG category[S] Function unknown 
COG ID[COG1376] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.68044 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCTC GCCCCTGGTA TGCACCGCTC ACAATGATCG GCATCGTGGC GCTGACGACG 
CTGGCGCTTG TGGCGCCTGC CGAACTGCGC GCGCGGCAGC GCCCCACAGT CGCCGATTTT
GCGCCGCCAA TGACGCCATC CGAAGCGCAG GCGATCGTCG CCGCCGCGCG CGCTGCCGAA
CGCGCGCAAC GCATCGAACA GATGCAGCAA CAGAGCGGGC AGGATGCGGT TCCACTGCCG
GCCGTGCAAA ACATGTATTT CGCGTCCTCC GGCTTTCACA TCAGCGACCG CACCGGATTC
CTGAGCTTCT GGCGCAGGAA CGGCGGCGAT CTGATTTTCG GGTATCCGAT CAGCGGCGAA
ATGGTCGAAG ATGGGCGGAT CGTCCAGTAT TTCGAGCGCG CCCGCTTTGA GTACCATCCC
GAACATCTGG GAACTGACTA CCAGGTGATG TTGTCGCTCC TTGGCAACGA ACTGACCCAG
GGGTACGATT TTCCCGATGG ACAGCCGACG CAGGGGCGGA TCTACTTTCC CGAAACACGT
CAGACGCTTG GCGGCAAGTT TCTGAAGTTC TGGCAGAAGC GCGGCGGGTT GCGCATCTTT
GGCTACCCGA TCAGCGAACC GTTCGAGGAA ATCAGCCCGA TTGACGGACA GGTGCGCATC
ACGCAGTATT TCGAGCGCGC GCGCTTCGAG TACCACCCAG AGAAACTTCC GGCGTTCTAT
CGCCAGATGG AACGGGCAAA CGGGATTATG CTCGCCGGAC TCTACGAAGT CCAGTTGACC
GATCTGGGAC GGCAGGCGAT GCAACGCCGG GGACACACGC CGCAGTCAAC CGGTCCGATG
CCCGGCGCGC CGGTCTGGTC ATCTGCATTA TTCGAGCGAC GCATTGAGGT AAATCTCTCA
ACGCAGATGC TGACGGCATT CGAGGGGGAG GCGCCGGTCT ATCGCGCCCC GGTTGCAACC
GGACGCGACG GTTTCAATAC GCCGGTTGGA ACATTCGCCG TGTACTCCAA ACTGCCGATA
CAAACGATGA CCGGCTCGGC GGGCGGCGAG TCGTGGTATG TGCCCGATAT TCCATGGGTG
CAGTATGTGG TTGGCGGCGT GGCGCTCCAC GGCACCTACT GGCACGACGC CTGGGGCACA
GGGGTGCGTA TGTCGCACGG GTGTATCAAC CTGAATATCG ATGACGCGGA ATGGCTGTAT
CGCTGGACGG ACATTGGAAC CCGGGTGGAT ATTATTGACT GA
 
Protein sequence
MTSRPWYAPL TMIGIVALTT LALVAPAELR ARQRPTVADF APPMTPSEAQ AIVAAARAAE 
RAQRIEQMQQ QSGQDAVPLP AVQNMYFASS GFHISDRTGF LSFWRRNGGD LIFGYPISGE
MVEDGRIVQY FERARFEYHP EHLGTDYQVM LSLLGNELTQ GYDFPDGQPT QGRIYFPETR
QTLGGKFLKF WQKRGGLRIF GYPISEPFEE ISPIDGQVRI TQYFERARFE YHPEKLPAFY
RQMERANGIM LAGLYEVQLT DLGRQAMQRR GHTPQSTGPM PGAPVWSSAL FERRIEVNLS
TQMLTAFEGE APVYRAPVAT GRDGFNTPVG TFAVYSKLPI QTMTGSAGGE SWYVPDIPWV
QYVVGGVALH GTYWHDAWGT GVRMSHGCIN LNIDDAEWLY RWTDIGTRVD IID