Gene Rcas_2424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2424 
Symbol 
ID5539905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3117599 
End bp3119785 
Gene Length2187 bp 
Protein Length728 aa 
Translation table11 
GC content62% 
IMG OID640894554 
Productextracellular solute-binding protein 
Protein accessionYP_001432522 
Protein GI156742393 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.935396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAACG CTCGAAAGCT CAACCGGCGC ACATTCCTGC GCCTGTCCGC TGTGACGGCG 
GCAAGCGCAG CGATTGCCGC CTGTGGCGGC GGCGGTCAGC CTGCCACCGC ACCGACGACA
GCGCCCGCCG CGCCTCAACC GACCACAGCG CCAGCGGCGC CCGCTCCAAC CACTCCTCCA
GCCGCAACAG CTGTTCCCCC GGTGACGACC CAATACAAAG AAGCGCCGAT GCTGGCAAAA
CTGGTGCAGG AAGGTAAACT GCCGCCGGTG GACGAGCGCC TGCCGAAGAA CCCCTACACT
CCGCCACACT CCTGGCTCAC AGTTGGCAAG TACGGCGGCG TGCTGAAGAA GACCTACAAC
AACAACTGGG GTATCACCGG CTTCATCCAC GAGATGCAGT ATGGCTCGTC GCCGCTGCGC
TGGCTTAAGG ATGGTCTGGC GATTGGTCCA GGCTTCGTCG AAAGCTGGGA ATCGAATGCC
GACGCGAGCA AGTGGACGTT CAAGATCCGC GAAGGGATCA AGTGGAGTGA CGGTCAACCC
TTCACCACCA AAGACATTAT GTACTGGTGG GAGTACACGG TCGGCGGCAA CGGCAAGGAG
AAGGAGTACC CCGCCGGTCT CAAGCCGATC AACAGCCCGC CTGACGAGGC GCGCTCCGGC
ACCGGCACCC TGATGACACT CAATGCGCCG GATGATTACA CCTTCGAGAT GGTGTTCGAC
GCGCCGGCGC CGCTCACGGC GGACCGTCTG GCAATGTGGG TCAACATGTT CATCGGTCCG
GCCTGGGTTA TGCCACGTCA CTACATGGAG CAGTTCAACC CGGTGCTCAA CCCCGATAAG
TATAAGGACT GGGAAGAGCA TCAGCGCAAG TTCAACCACA ACAACCCCGA CTGCCCGCGC
CTGACCGGTT GGAAACTGGA TATCTTCGAG GAGGGCGTCC GCGCTGTCTG GTCGCGCAAC
CCGTACTACT GGGCGGTCGA TAAGGAAGGC AACCAGTTGC CGTATATCGA CCAGATCATT
GTGACGGCGG TCAAGGACAA GGAGATCGAG AAACTGGCGT ACACCGAGGG GCGCGCCGAT
CATGCGCACT TCCACAGCCA GGGTCTGGCG GATGTGCAGT CGTTGCGCGA CGCCGAAGCC
AAGAGCGGTC TCGAGGTGCG CTTCTGGGAC TCGGGTTCGG GCACCGGTTC GCTCTACTTC
TTCAACATGG ACTTCAAGGA CCCGAAGATG CGCGCCGTGT TCCGCGATCC GAAGTTCCGC
CAGGCGCTGT CACACGCTTA CAATCGCGCC GATGTGCAGA AAGCGGTCTA TTTCGGGTTG
GGCGAACTGA CGACCGGCAC CTTCAGCCCC AAGGCCATCG AGTACAACAT CAACGATCAG
GGCAAGCAGG TGTATGCCGC CTGGCGCGAT AGCTACGTGA AGTACGATCC GGCGCTGGCT
GAACAGATTC TGGACGAAGC CGGCTACAAG AAGGGACCTG ATGGCAAGCG CACGATGCCG
GACGGCAGTC CGCTTCAGAT TCAGATCACC TATGGCGCCG ATCAGGCGCC CGGCGGTGAG
CACCTGTCGA AGAACGAGCG CCTGGCGCGC GACTGGCAGG CGATCGGGAT CGATGCGGTG
CTGACACCTA TTCCGGGTGA GGGCGCCGAC GAGAAGTGGC GCGCCGGCGA GTTGCCGATG
AAGACCACCT GGGAGGTCGG CGACGGTCCC AACCACCTGG TCTTCCCCTC CTGGCTGGTG
GCGGATGAGA CCGAGCGCTG GGCGCCGCTG CACGGTCGCG GGTATACGCT GCGCGGCACT
GCGTCGGAGA AGGAGGAACT GGATAAGAAC CCATGGGACC GCAACCCGCC GCGCATCAAT
CGTGGCGAGC CGGACTATAT GCCGGCGATT GGCAAGCTTC ACGAACTGTT CGACAAGAGC
AAGGTGGAGC CAGATGCGAT GAAGCGCCAC CAACTCGTGT GGGATATGAT CAAAGTCCAC
ATCGAGGAGG GTCCGTTCTT TACCGGGACG ATCGCAAACC CGCCGCGCAT CATTTTGGTG
AAGAAGGGGT TGATGAACGT GCCGACCCGC GATGACCTTT TGAAGGAAGG GTTGGGCGGT
TTCGTCAATC CGTGGATCAT CCCCTCTCCG GCGACCTATG ACCCGGAGAC CTGGTACTGG
GATAATCCTG AGGCGCATAC GGCGTAG
 
Protein sequence
MSNARKLNRR TFLRLSAVTA ASAAIAACGG GGQPATAPTT APAAPQPTTA PAAPAPTTPP 
AATAVPPVTT QYKEAPMLAK LVQEGKLPPV DERLPKNPYT PPHSWLTVGK YGGVLKKTYN
NNWGITGFIH EMQYGSSPLR WLKDGLAIGP GFVESWESNA DASKWTFKIR EGIKWSDGQP
FTTKDIMYWW EYTVGGNGKE KEYPAGLKPI NSPPDEARSG TGTLMTLNAP DDYTFEMVFD
APAPLTADRL AMWVNMFIGP AWVMPRHYME QFNPVLNPDK YKDWEEHQRK FNHNNPDCPR
LTGWKLDIFE EGVRAVWSRN PYYWAVDKEG NQLPYIDQII VTAVKDKEIE KLAYTEGRAD
HAHFHSQGLA DVQSLRDAEA KSGLEVRFWD SGSGTGSLYF FNMDFKDPKM RAVFRDPKFR
QALSHAYNRA DVQKAVYFGL GELTTGTFSP KAIEYNINDQ GKQVYAAWRD SYVKYDPALA
EQILDEAGYK KGPDGKRTMP DGSPLQIQIT YGADQAPGGE HLSKNERLAR DWQAIGIDAV
LTPIPGEGAD EKWRAGELPM KTTWEVGDGP NHLVFPSWLV ADETERWAPL HGRGYTLRGT
ASEKEELDKN PWDRNPPRIN RGEPDYMPAI GKLHELFDKS KVEPDAMKRH QLVWDMIKVH
IEEGPFFTGT IANPPRIILV KKGLMNVPTR DDLLKEGLGG FVNPWIIPSP ATYDPETWYW
DNPEAHTA