Gene Rcas_4124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4124 
Symbol 
ID5541635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5337059 
End bp5338639 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content64% 
IMG OID640896236 
ProductABC transporter related 
Protein accessionYP_001434174 
Protein GI156744045 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR01166] cobalt transport protein ATP-binding subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000316959 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGACCAT TGGTTGTCTG CGATGAGGCG ACCTATCGGT ATCGAGACGG AACGCTCGCC 
CTGGATCGTG TGTCGCTTGC CATCGAAACG GGAGAGTTCG TGGTGCTGGC AGGCGCGAGC
GGGTCGGGCA AATCCACCCT CTGTCGCCTG CTCAACGGTC TCATTCCGCA CCTCCACGGC
GGCGATCTGA CCGGGCGCGT CCTGATTGCG GGGCAGGATG TTCGCTCGAC GCCACCCTAT
GCTCTGAGCC GCAGTGTGGG GCTGGCGTTG CAGAACCCCG AAGCGCAGAG CCTGGCAACA
ACTGTTGCCC GCGATCTGGC GCTCGGTCCT GCATGCCACG GGCTTGACCG CGCGACGATT
GCTGCGCGCG TCCGCGAGGT TGCTGCGCTA TTAAGGATTG AACCGCTCCT TGATCGCCAA
CCGGTCACAC TGTCGGGAGG GGAATTGCAG CGGGTAGCCA TTGCCGGAGT GCTGGCGCTC
CATCCACAGG TGCTGGCGCT CGATGAGCCG TTTGCGTTTC TCGACGCCGC CGGCGCTATG
CGGTTGCGCG AGACATTGCG CATGCTCCAT GAACGGGGGG TCGCCATTAT CGTTGCCGAA
CACCGTCTGG CAGACGTAGC AGACCTGGCG ACGCGCCTGA TCGTCCTTCA CGAAGGTCGG
ATCGTGGCAG ATGGCGCGCC GCGAACGGTG CTGGCAGGCG ATGTGTCACA GTGGGGAGTA
GAAGCGCCAC CATGGGCGCG CCTGGCGCAT GTTGCCGGGA TAGATGCGAT GTCGCCGACG
CTTGATGGAG CGCTTGATCT GGCGTCGCCG AACGGCAGTG CGCTCCATTC GTCACCGCAC
AATGCACCGG TTACACCGCC AGCGCTTAAC TGGGATGACG TATCATTTGC GCGCAACGAC
AGGATAGTGC TGCATCAGGC GTACCTGTCG GCAGCGGCGG GGGAAATTGT TGGGGTGCTG
GGCGCGAATG GCGCGGGTAA AACGACGCTG CTCAAGCTGG GCAACGGGTT GCTTCGGCCA
CAACGCGGAA CGGTGCGTGT TCAGGGACAG GCAATCGGGC AACGTCCGCT CTGGGAGGTC
GCGCGCAGTG TCGGTCTGGT GGGACAACAG CCGGGACATA TGCTGTTTGC GCCGACGGTG
CAGGACGAAC TGGAGGCGGG ACCACGGGCG CTCCGGCGAG TTGATCGCGC CTGGATCGGC
CACGTGATCG AGCAGTGCCG TCTGGAACCG TTGCTCCACC GTTCGCCACA TCATTTGAGC
GCGGGCGAAC AGCGGCGCGT GGCAATCGGG GCTGTGCTGG CGTCGCAACC ATCCGCGCTG
CTCCTCGACG AGCCGACATC CGGGCAGGAT GCGCTCAACC GAAAGGCTTT GCAAAAGATC
ATTGGCGACA TCGCCCGCGA TGGAATGGCG GTCGTTATCG CCACTCACGA CACTGAATGG
GCATATGCGC TCTGCACCCG CTGGGCAGTG CTGGATGCCG GCAGGATCAT TGCGAGCGGC
GCGCCATCGG CAATATGCGC GCAACCGGCA ATCGTCGCGC AGGCGCGCCT TCGCCTCCCA
ATGGCAGAGG CGATCCGGTA G
 
Protein sequence
MRPLVVCDEA TYRYRDGTLA LDRVSLAIET GEFVVLAGAS GSGKSTLCRL LNGLIPHLHG 
GDLTGRVLIA GQDVRSTPPY ALSRSVGLAL QNPEAQSLAT TVARDLALGP ACHGLDRATI
AARVREVAAL LRIEPLLDRQ PVTLSGGELQ RVAIAGVLAL HPQVLALDEP FAFLDAAGAM
RLRETLRMLH ERGVAIIVAE HRLADVADLA TRLIVLHEGR IVADGAPRTV LAGDVSQWGV
EAPPWARLAH VAGIDAMSPT LDGALDLASP NGSALHSSPH NAPVTPPALN WDDVSFARND
RIVLHQAYLS AAAGEIVGVL GANGAGKTTL LKLGNGLLRP QRGTVRVQGQ AIGQRPLWEV
ARSVGLVGQQ PGHMLFAPTV QDELEAGPRA LRRVDRAWIG HVIEQCRLEP LLHRSPHHLS
AGEQRRVAIG AVLASQPSAL LLDEPTSGQD ALNRKALQKI IGDIARDGMA VVIATHDTEW
AYALCTRWAV LDAGRIIASG APSAICAQPA IVAQARLRLP MAEAIR