Gene Rcas_0164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0164 
Symbol 
ID5537625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp199681 
End bp200991 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content62% 
IMG OID640892328 
ProductABC transporter related 
Protein accessionYP_001430316 
Protein GI156740187 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1116] ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGACAT CCAACGGCGC AGTGCTGGTC GAACTGAAAA ATGTTGCGCA ACGCTACGGG 
CGCGGCGAGA AACGCTTCAC CGCTATTGAG CATATCAACC TGACGATTCG CGAGGGAGAG
TTCGTTGCGC TGCTCGGTCC GTCGGGTTGC GGTAAAAGCA CGCTGCTTCG CATTATCACC
GGCTTGAATC GCCCGAGTGA CGGCGAGGTG CGCTACCGCG GCAAGTCGCT GAACGGCGTC
AATCCGTATG CGACGATTGT CTTTCAGTCG TTTGCGCTTT TTCCGTGGTT AACCGTCGAA
GGGAATGTAA CTGTCGCCAT GCGGGCGCGT GGCATGCCGG CGGAACAGGC GCGCGCCCGC
GCCATTGAGT TGATCGATCT CGTTGGCCTC GACGGTTTCG AGCAGGCGTA TCCGCGTGAA
CTGTCGGGCG GCATGCGCCA GAAAGTCGGG ATTGCCCGCG CACTCGCAGC CAACCCGGAA
TTGCTCTGCC TCGATGAGCC GTTCAGTGCA CTCGATGTGC TGAGCGCCGA GACGCTGCGC
GGCGAAGTGC TCGAACTCTG GACCGGCGGC CAACTCCAGA TCCGCGCCGT GCTCATGGTG
ACCCACAACA TCGAAGAAGC GCTCTTCATG GCGGATCGGA TCGTCGTGAT GGACAAAGGT
CCAGGGCGGA TCATCGCCGA AGTGCCGGTG TCCCTGCCCC ATCCGCGCGA CCGCAAATCC
GAATCGTTCG TCGCGTTGAC CGACCGCGTC TACGGCATTC TCATGGGGCA GACGCAGCCG
GAGCACGTCG AGTTCGGCAC GGAACCGGGG CAGGCCGGGC GCACACGTGC GCTGCCAAAC
GCCGATGTCA CCGAACTCGC CGGTCTGCTC GAACATGTGA ACAGCACTCC CACCGAGCGC
GACGATATTT ACCAACTGGC AGAAGACCTG GGAATCGATT TCAACAACCT GCTGGCGCTG
GTTGAGGCGG CGGAATTGCT TGGCTTCGCC CGTGTCGAGA CGGGTGACCT GATCATTACG
CCGCTCGGCG AAACGTTTGC CGACGCCAGC ATTCTGGCGC GCAAAGAAAT CTTTGCGTCG
CGGTTACGGC GGTTGCCCTT CTTTCGCTGG ATGCTGCGCA TGCTGGAAGC GTCGGGTGAA
CGCTCACTGC GTCGGGAAGT GCTCCTCACT GCTCTGGAGC GCGACTTCCC GCCTGTTGAA
GCGGAACAGC AACTCGATAT CGCCAGCAAA TGGGGGCGCT ACGCCGAACT CTTCGGCTAC
GACGACGCTC AGGGGCGCTT CTTCCTCGAA GAGGTCGCTA CCCCCGCATA A
 
Protein sequence
MATSNGAVLV ELKNVAQRYG RGEKRFTAIE HINLTIREGE FVALLGPSGC GKSTLLRIIT 
GLNRPSDGEV RYRGKSLNGV NPYATIVFQS FALFPWLTVE GNVTVAMRAR GMPAEQARAR
AIELIDLVGL DGFEQAYPRE LSGGMRQKVG IARALAANPE LLCLDEPFSA LDVLSAETLR
GEVLELWTGG QLQIRAVLMV THNIEEALFM ADRIVVMDKG PGRIIAEVPV SLPHPRDRKS
ESFVALTDRV YGILMGQTQP EHVEFGTEPG QAGRTRALPN ADVTELAGLL EHVNSTPTER
DDIYQLAEDL GIDFNNLLAL VEAAELLGFA RVETGDLIIT PLGETFADAS ILARKEIFAS
RLRRLPFFRW MLRMLEASGE RSLRREVLLT ALERDFPPVE AEQQLDIASK WGRYAELFGY
DDAQGRFFLE EVATPA