Gene Rcas_4372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4372 
Symbol 
ID5541885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5625565 
End bp5626653 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content62% 
IMG OID640896478 
Productperiplasmic binding protein 
Protein accessionYP_001434414 
Protein GI156744285 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4558] ABC-type hemin transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.670262 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCCA GCCATCATGA GCGCCACGCG GCGTCGGTCG GTCGGTTGCG TCGGATGCTT 
CTTGGGTTCA GCCTGGCTGT CGTGGCGTTG CTCGCTGCAT GCGGGACGCC GTCGCCTTCC
GGTGTGATTC CTCCCGCAGC CACGAGAGTT CCCGCCTCTT CATCGCTCCC AACACCTGCA
TCCACGAGTG TGCCGGGGAT CGTCGAGTCT GTTCCAGGCG AGGCAGAACC GCAACTTCCG
GCCACCGTCG TCGATTATCA GGGCGAGCAG GTGACGATTA CGTCCATCGA ACGGATCGTC
AGTCTGAACG GCGATGTGAC CGAAATCATT TTTGCGCTTG GGATGGGGGA TCATGTCGTC
GGCGTCGATA GCAGTGCCAC ATTTCCTCCC GAACGCACCA AAATGCTGCC AAACATCGGC
TATCAACGGC GATTGAGCGC CGAAGGAATC CTGGCGCTCA ATCCGACGCT GGTGATCGGC
GATGAGGCGG CCGGTCCGCC CGAAACGCTG GCGCAGATCC GCACCGCAGG CGTGCCGGTG
GCGATCACTG CCGATCCGCC AACGCTCGAT GCACCGGTGC AGAAAATTCG GTTTGTCGCG
CAGGCGCTCG GCATTCCGCA GCGCGGCGAA CGCCTTGCCG CGCAGGTCGA AGCCGAGATC
GCGCGCGCGC GCGACCTGGC GAGTCGAATA ACGAACCCGC CGCATGTCCT CTTTCTCTAT
CTGCGCGGCA CGGATGTTCA GCAGGTCGCC GGCAGTAAAA CGCCGGTCAA TGTGATGATC
ACTGCCGCCG GCGGACTCAA TGCAGGTGCG GAAGCCGGGA TTGTGGAGTT CAAACCGTTG
AGTCCCGAAG TGGTCATTGC TGCGCAACCC GATGTGATTC TAGTGCTGGA AAAAGGGCTG
GAGTCAGTTG GCGGCGTCGA TGGTCTGCTG ACCATCCCCG GTCTCGCTGA CACGCCGGCC
GGGAAACAGC GTCGGATCAT TGCATTCGAT GATCTCTACC TGCTCGGCAT GGGTCCGCGC
ACCGGCCAGG CGCTCGCCGA TCTCGCCATC GCATTGTATG AGACTTCATC ACAGGAGAAG
CATCCATGA
 
Protein sequence
MTSSHHERHA ASVGRLRRML LGFSLAVVAL LAACGTPSPS GVIPPAATRV PASSSLPTPA 
STSVPGIVES VPGEAEPQLP ATVVDYQGEQ VTITSIERIV SLNGDVTEII FALGMGDHVV
GVDSSATFPP ERTKMLPNIG YQRRLSAEGI LALNPTLVIG DEAAGPPETL AQIRTAGVPV
AITADPPTLD APVQKIRFVA QALGIPQRGE RLAAQVEAEI ARARDLASRI TNPPHVLFLY
LRGTDVQQVA GSKTPVNVMI TAAGGLNAGA EAGIVEFKPL SPEVVIAAQP DVILVLEKGL
ESVGGVDGLL TIPGLADTPA GKQRRIIAFD DLYLLGMGPR TGQALADLAI ALYETSSQEK
HP