Gene Rcas_2837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2837 
Symbol 
ID5540326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3677271 
End bp3678461 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content57% 
IMG OID640894966 
Productextracellular solute-binding protein 
Protein accessionYP_001432926 
Protein GI156742797 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000153831 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAAG GACGCACCAG GTTCATTGCG CTGTTGCTCG CCGTGCTGCT GACACTCGCA 
GCATGCGGCG GGCAGCCGAC CGGCAGCCCC GGCAATGAAT ACGGCAGCGG CGGCGCGACA
ACCGAGCCGA CGACTGCTCC TCCGGCGCAA CCGTCTACAG GTGACGAGTT GCAGGTTGAT
CGCTCGCGGC TCTCGAGTGA ACTCAGATTC TTCAACTGGA CGGATTATGT CGATCCTTCA
ATCCTGGAAG ATTTTGAGAA AGAGTATGGC GTCAAAGTGA TTGTTGACCT CTTCGACGCT
AACGAAGATA TGCTCGCCAA GGTGCGCGCC GGGCGCTCCG GGTACGACAT TGTGACGCCA
TCGGACTATG CGGTTGAGAT CATGTGGCGT GACGGACTCA TCGCAAAACT TGACAAGTCG
CTGCTGCCCA ATTTGAAGAA CATCGATCCC GACCTGCTCA ATAAATATTT CGATCCGGGG
AATGTCTATT CTGTGCCGTA CATGTACGGC ATTACTGGAA TCGCCTACAA CCGGCAATCC
TTCCCCAACG GCGTCGAGAG TTGGGCGGTG CTGTTCGACA CGGCGGAGAT CGCGCGCTAT
CGCGGTCAGT TCAGCATGCT CGACGATGAA CGCGAAACAC CCGGCGCGGC GCTGAAATTC
CTGGGCTACT CACTGAATGA AACCAGCCCG GAGGCGCTGA AAAAAGCGCA GGACCTGTTG
ATCGCCCAGA AACCGTTCCT GGCCGGGTAC AACAGCAGTG ATGTCAACCG GAAACTGGCG
AGCGGCGAAT ATGTGATTGC GCATGCCTGG AGCGGTTCGG CATTGCAGGC GCGCAACGGC
TTGGGCGACG AGTTCTCCGG CAACCCGGAT ATCGCCTTTG TTATTCCAAA GGAAGGCGGC
ATGATCTGGA TGGACAATAT GGTCATCCTG GCCGACTCGC CGAACGCTTA TACGGCGCAT
GTGTTCATGA ACTTCCTGAT GCGCCCTGAT ATTGCTGCGC GCAACGCCGA ATACATTGGC
TATCTCTCGC CGAACGTCGA GGCGATCAAA CTGCTGCCGC AGGAGATTAT CGACCTGTAT
AACGAAGGGT TTGCCCCGAA CGATGAGGTT CTGAAACGGT TGGAATGGGC AATACGCAAT
GATCAGACTG CCGCCTTCAC CGACCTGTGG ACGGCGGTGA AAGGGGAGTA G
 
Protein sequence
MLKGRTRFIA LLLAVLLTLA ACGGQPTGSP GNEYGSGGAT TEPTTAPPAQ PSTGDELQVD 
RSRLSSELRF FNWTDYVDPS ILEDFEKEYG VKVIVDLFDA NEDMLAKVRA GRSGYDIVTP
SDYAVEIMWR DGLIAKLDKS LLPNLKNIDP DLLNKYFDPG NVYSVPYMYG ITGIAYNRQS
FPNGVESWAV LFDTAEIARY RGQFSMLDDE RETPGAALKF LGYSLNETSP EALKKAQDLL
IAQKPFLAGY NSSDVNRKLA SGEYVIAHAW SGSALQARNG LGDEFSGNPD IAFVIPKEGG
MIWMDNMVIL ADSPNAYTAH VFMNFLMRPD IAARNAEYIG YLSPNVEAIK LLPQEIIDLY
NEGFAPNDEV LKRLEWAIRN DQTAAFTDLW TAVKGE