Gene Rcas_1454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1454 
Symbol 
ID5538928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1856216 
End bp1857385 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content60% 
IMG OID640893592 
Productextracellular solute-binding protein 
Protein accessionYP_001431567 
Protein GI156741438 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID[TIGR01096] lysine-arginine-ornithine-binding periplasmic protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.758875 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCG TGCTCGGATG GCTGATCGTC ATTCCGCTCC TGCTGGCTGC GTGTGGTCAA 
CCGGCAGCCC AGACGGGGCA ACCGGTGGAA GTGACCCGCA TCGTCGAAGT GACTCGCGTT
GTCGAAGTGA CGCCGGTTGG CGGCGGCGCG CCTGCACAAC CGGCAGCGAC TCCCGCGCCT
GCTCCGGCGT CTGCCGCTCC TGCTGGTTTC GGTGAGACGC TCAAGGCGAT TCAGGCGCGC
GGCAAACTGA TCTGCGGCGT CAACAGCCAG GTTCCCGGTT TTGGGTTCGT CGATCCGACA
GGCGCTTTCA GCGGTTTCGA TATCGATTAC TGCAAGGCGC TGGCAGCAGC CATTTTCAAT
GATGTCAGCA AGATCGAATA TCGTCCACTG ACTGCGGAGC AGCGCTTTGC CGCCTTGCAG
AGCGGCGAGA TCGATGTGCT CATCCGCAAC ACGACCTGGA CGCTCACCCG TGATACCGAC
AACGGCGGCA ATTTCGTCGC CACGACGTTC TACGACGGTC AGGGTATTAT GGTCCCGAAA
GCCTCGAATA TTACGAAGCT CGAAGACTTG AACGGCGCGA CGATCTGTGT GCAGAAGGGC
ACGACGACCG AGTTGAACCT GGCGGACCAG ATGAACGCGC GTAACCTGGC GTATACCCCC
GCAACGTTCG AGGACGCCAA CAGCACCTTC GCCGCCTACG CCGAGGAGCG TTGCGATGCC
GTGACGACCG ATAAGTCCGG TCTCGTGTCG CGCCGCTCAG TGCTGCCCAA CCCCGATGAT
CACGTCATCC TCGATGTGAC GCTGTCGAAG GAGCCGCTTG GTCCGATGGT GCGCCAGGGC
GACGACCAGT GGTTCGACAT TGTGCAGTGG GCGGTATTTG CCACGTTTGC CGCTGAAGAG
TTCGGTATCA CATCGCAAAA TGTGGATCAA CTCAAAGAAA CCGATACCCG TCCCGAAGTT
CGGCGCTTGC TTGGCGCTGA TCCAAATGTG GACCTCGGTG CGAAGCTGGG CTTGAGTAAG
GATTGGGCTG CCAATGTCAT CAAAGCGGTC GGCAATTATG GCGAGATCTA CGACCGCAAT
CTCGGCCCGA ATACGAAGAC TGCCATTCCA CGTGGCATCA ACAATCTTTA CACTCAGGGC
GGATTGCTCT ACGCGCCGCC GTTCCGGTAA
 
Protein sequence
MKRVLGWLIV IPLLLAACGQ PAAQTGQPVE VTRIVEVTRV VEVTPVGGGA PAQPAATPAP 
APASAAPAGF GETLKAIQAR GKLICGVNSQ VPGFGFVDPT GAFSGFDIDY CKALAAAIFN
DVSKIEYRPL TAEQRFAALQ SGEIDVLIRN TTWTLTRDTD NGGNFVATTF YDGQGIMVPK
ASNITKLEDL NGATICVQKG TTTELNLADQ MNARNLAYTP ATFEDANSTF AAYAEERCDA
VTTDKSGLVS RRSVLPNPDD HVILDVTLSK EPLGPMVRQG DDQWFDIVQW AVFATFAAEE
FGITSQNVDQ LKETDTRPEV RRLLGADPNV DLGAKLGLSK DWAANVIKAV GNYGEIYDRN
LGPNTKTAIP RGINNLYTQG GLLYAPPFR