Gene Rcas_3842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3842 
Symbol 
ID5541346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5021678 
End bp5023384 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content59% 
IMG OID640895952 
Productextracellular solute-binding protein 
Protein accessionYP_001433897 
Protein GI156743768 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0450891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.108057 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACC GATGCTCTGA TCAGGGAGAA CAAAAGGTGC GTCGAATCGA TCGGCGCGCG 
TTCCTGCGTC TTGTGGCGGC AGGCGGAGGT GCGCTGGCAT TGCAGGCTTG CGGTGGCGGC
GCTCCACCCG CTGCTGCGCC GACGACCGCC CCCGCTGCAC CGACGGTTGC CCCGGCTGCG
CCGACGGCTG CTCCTGCCGC GCCGACGGCT GCTCCTGCCG CGCCGACGGC TGCACCCGTT
GCATCGAACG CGCGCGGAGG CAAAGTCACC TGGGCGATGC TTGGTGATCC GGTGTCGCTC
GAACCATACG GAATCAACAT CACGGGCCAG TACAATTACG AAGCGCGCGA GCCGATGTAT
GACTCGCTGC TCGTGTGGGA TCGCGATCTG AAAGTACAGC CATCGCTGGC AGAGTCGTTC
GAGACGCCGG ACGATACGAC CTATATCTTC AATCTGCGTT CGGGCGTGAA ATTCCACAAC
GGCAGGGAAC TCACCGCCGA GGATGTCAAA TTCTCGCTCG ATACCATCAT CAATCCGCCT
GACGGACGCA ACCCCGGTGC AGCATTCTTC GCCAATTTCG ACACGATTGA GGTCGTTGAC
CCATTGACGA TTCGCATCAA TCTGAAGAAA ATCGACCCGA CCATTCCTGG GTTGTTCGCC
TGGTCGCGCT ATACAAACAT CTTTCCCACC GACATGCCGT CGCAGATCAA TCCCGTGACC
CAGGCGATCG GCACCGGTCC CTTCCGATTG GTCAGTTATA CGCCCAACGC CGAAATCGTC
TATGAGCGCT TCTCCGATCA CTGGAACCCC GAACAGCCGA ACATCGATCA ATTGATCTAT
CGCGTCATCC CTGAGGAAGA TGCCCGCATT GCGGCACTGC GCTCCGGCGA TATCGACGGG
ACGGACGTTA CGCCGTTAGG TGCGCGGCGA CTCCAGAACG ACTCCGACAT CACCATTCTG
AAAGGTCTCT ACTCGCAGCC GAAGGTGCTT CAGTTTACAC TGAAGGGCGG GAAACCGTGG
GACATCAAAG AAGTGCGCCA GGCGATCAGT CTGACCATCG ACCGCCAGGA ACTGATCGAC
AAGGTGATGG AAGGCGAGGC GGAACTGACG GGACCGGTCG TGCCCGGCTA TGGCGACTGG
CCCCTCAGCC AGGATGAACT GCGCGCCGCG TACCAGGTTG ATGTCGAAAA GGCGCGCCAG
TTAATGGCGC AAGCCGGTTA TGCCGACGGC TTCAAGGTGA CGGCGATGAC CTTCGCCAAC
TACTCGAACG ATAATGCCAT CATTGTACAG GAACAACTGC GCCAGTTGAA CATCGACATG
CAGATTGAAC AGATCGAGTT TGGCACATTC GCGCAGCGCG TCACCAATGG CGAGTTCGAG
TGGTGCTTTA CGGCGCGCGG CATGCGCGCC GACGTGAGCG GATACCTGAA CGACTTCCGT
CGTCTGGGCA TTGCCGAGAA GAACTGGTTT CCCGCCTGGG AGAACGCTGA GCTTAATGAG
GCGTATGATG CAGCAATGGC GACGTTCGAT CAGGCAAAAC GACGCGAATT GATGCAGAAA
GTGCAGCGCA TCGTCATTAA TGAAGCGCCA CACATCTATC TCTACCAGGA CTACCGTTTC
TCGGCGGTTC GGAAGCGGGT GCAGAATTAC TACGTCGCCT TCACGACGTT CCGCCCCGCG
CTGCGCGAAA TCTTTGTGAC CGCGTAA
 
Protein sequence
MNDRCSDQGE QKVRRIDRRA FLRLVAAGGG ALALQACGGG APPAAAPTTA PAAPTVAPAA 
PTAAPAAPTA APAAPTAAPV ASNARGGKVT WAMLGDPVSL EPYGINITGQ YNYEAREPMY
DSLLVWDRDL KVQPSLAESF ETPDDTTYIF NLRSGVKFHN GRELTAEDVK FSLDTIINPP
DGRNPGAAFF ANFDTIEVVD PLTIRINLKK IDPTIPGLFA WSRYTNIFPT DMPSQINPVT
QAIGTGPFRL VSYTPNAEIV YERFSDHWNP EQPNIDQLIY RVIPEEDARI AALRSGDIDG
TDVTPLGARR LQNDSDITIL KGLYSQPKVL QFTLKGGKPW DIKEVRQAIS LTIDRQELID
KVMEGEAELT GPVVPGYGDW PLSQDELRAA YQVDVEKARQ LMAQAGYADG FKVTAMTFAN
YSNDNAIIVQ EQLRQLNIDM QIEQIEFGTF AQRVTNGEFE WCFTARGMRA DVSGYLNDFR
RLGIAEKNWF PAWENAELNE AYDAAMATFD QAKRRELMQK VQRIVINEAP HIYLYQDYRF
SAVRKRVQNY YVAFTTFRPA LREIFVTA