Gene Rcas_3171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3171 
Symbol 
ID5540669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4116578 
End bp4117726 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content59% 
IMG OID640895292 
Productextracellular solute-binding protein 
Protein accessionYP_001433243 
Protein GI156743114 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCATA CGCCAGGACG CCTGACGTTT CGCCTGATAC TCCTTGCAAC GAGCCTTGTG 
TTGATCGCTG CCTGCGGCGG GCAACCTGCT GCCGCTCCGC CCACGACCGC GCCTGCGGAG
TCGACAGCAG CGCCAACCAC CGCACCGGTC GAGCCGACCG CTGCACCAAC CACTGCGGCT
GAGCAACCGA CCGGAGCGCC GGTCGCTGCG CCAAAGGGCG AAGTCATTGT GTACACCTCA
CGCGCCGAGG CGCTGTTCAA ACCGGTCATC GAAGCCTTCA ATGCCGTCTA TCCAGATGTT
AAGGTGACCG TCTTGAATGG CAGCAACAGC GAACTGGCAG CCCGAATTCT CGAAGAGCGC
GCCAATCCGC AGGCGGACGT CCTGATTAAC TCCGATATTC TGACAATGGA GAATCTGGCG
GCGGAAGGGG TCTTTGCTCC GAACGACTCG CCGGCAGTGA TGGCCGTGCC GGCTGATTAC
CGCGCCGATG ATGGCAGTTG GGTCGCTCTG ACGCTCCGCG CTCGCGTCAT CATGTACAAT
ACCGATCTGG TATCGCCCGA CGAACTGCCG AAGAAAATGC TCGATCTGGC TGACCCGAAG
TGGAAGGATG TTCTCGGTTC CGCCAATAGC ACGAACGGTG CGATGATGGC GCAACTGGTC
GTGATGCGCA ATCAACTGGG CGAAGCGGCG ACTGAGGCGT TCATTCAGGG GTTGCTGGAA
AACAATACGC AGTTCTTCGG CGGTCACACC GATGTGCGCA AGGCGGTCGG CGCCGGTGAA
TTGAAACTGG GGCTGGTCAA CCACTACTAC TACCATCTCT CCAAAGCGGA AGGCGCGCCG
GTAGGTGTGA TCTACCCCGA TCAGGAAGAT GGCGGTCTGG GGCTAGTGGT CAACTCGACC
AACGCGGGGA TTATAAAGGG TGGACCAAAC CCGGAGATAG CAAAGATCTT TGTGGACTTC
ATGCTCTCGC CGGATGGTCA GAAGATCTAC GCCGAGCGCA ACTACGAGTA TCCGATTGTT
CCGGGCATTC CGCTGGCGGA GGGCGTTGCG CCGCTCAGTT CGTTCAAACT CAACCCATTC
CCGCTCAAGA CCTTGCGCGA TGAATTGGAA CCGACACGGG CGCTGGTTCA GAAAGTCGGT
ATGCCATAG
 
Protein sequence
MNHTPGRLTF RLILLATSLV LIAACGGQPA AAPPTTAPAE STAAPTTAPV EPTAAPTTAA 
EQPTGAPVAA PKGEVIVYTS RAEALFKPVI EAFNAVYPDV KVTVLNGSNS ELAARILEER
ANPQADVLIN SDILTMENLA AEGVFAPNDS PAVMAVPADY RADDGSWVAL TLRARVIMYN
TDLVSPDELP KKMLDLADPK WKDVLGSANS TNGAMMAQLV VMRNQLGEAA TEAFIQGLLE
NNTQFFGGHT DVRKAVGAGE LKLGLVNHYY YHLSKAEGAP VGVIYPDQED GGLGLVVNST
NAGIIKGGPN PEIAKIFVDF MLSPDGQKIY AERNYEYPIV PGIPLAEGVA PLSSFKLNPF
PLKTLRDELE PTRALVQKVG MP