Gene Rcas_3845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3845 
Symbol 
ID5541349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5026004 
End bp5027698 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content57% 
IMG OID640895955 
Productextracellular solute-binding protein 
Protein accessionYP_001433900 
Protein GI156743771 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.267838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0712114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTGC GCGATTTATT TGTCCGTCCA TCATCGATTC TGATGGTCCT GCTGATCGCT 
GGCTCGTTGA TTCTGGCTGC ATGCGGCGCA ACGCCGCAGC AGAGCGCTCC TGTTCCCACT
CAGGCGGGCG CCGCACCGCC GCCGACATCG GTTCCTGCCA CCCCTGCCGA CGCCAGACCA
CAGGCGACCG GTCGGACGCG CCTGGTTTTC TTCGGCAATC TGGAACCAGC CAGTATCGAT
GTCATCACAA TGTCGGGCAA CGTCAATGAA CGTCAGGTCG CCGCACAGAT TTTCGAAACG
CTGGTGTACA TGGATCAGAG CCAGCAGATA TACCCCGGTC TGGCGCGCGA ATGGAGCCGT
TCCGATGATG CGACGACGTG GACGTTCAAG TTGGTTTCGG GAGTAAAATG GCACGACGGC
ACGCCGTTCA CGGCTCAATC GGTGGCGGAC TATTTCGACT ACATTCGCGA TAATCCCGTC
GGAAGCGGAC CTGGATCGCT CAAAGCCGTA ATCGGCAGCG CCAAAGCAGT CGATGAAACG
ACGGTCGAAC TCACGCTCAC TGCGCCGCGT CCCGATTTGC TGATTGAGCT TGCGGATCCG
GGGATGGGCA TTGGAAATGC CGCCTACCTG AAACGGGTGG GCGCCGATGC CGGTTTCAAT
CCGGTGGGAA CCGGTCCATT CAAGTTCAAG GAATGGGTGC GCGGCAGCCA GATCGTGCTG
GAGCGCAACC CTGAATGGAC CTGGGGGTCA CCCTTGTTCA AGATGAGCGG TCCTCCTTTG
ATCGAAGAAG TGGTCTTTCG CTTTTCTTCT GAAGCGCAAA CACGACTTGC AGCGCTGGAA
GCCGGCGAAG TCGATTTCGT TGACCTTCTG CCATTCCAGG ATGTCGTGCG TGTCCGCACC
GATCCGCGCT TTACCGTCAC CGGCGTGCTG CTACCGGGCA TGCCGCAAAT GAACTATCTC
AACACCAGCC TGGCGCCGAC CGATGACATA AATGTGCGCA AAGCGATTAT CTATGCCACG
GACAAGCAAG GTATCATCGA GAGCGTTTAT TTCAACATGG TCGAACCGGC ATACGGACCG
CTGTCGCGCG TCTTCCCAGA GTACGAACCG GCGCTGGAGC AGATGTACGA GTACAATCCT
GAGAAAGCCG CGCAATTGCT CGAAGAAGCC GGCTGGCTTC CCGGTCCCGA CGGCGTTCGT
GTCAAGGACG GTCGGCGCCT TGAAGTGACG ATCGTTGAGA ATAAAGGCTG GAACGATTGG
GTCTATGTGC TGCAAGCCAA TCTCCAGGCT ATCGGCTTTG ACGCAAAAGT GCTCACCACC
CAGGGCCCGT CGAATACGGA AGCGATTGCC AGCGGCAAAT ACCACGTTCC GGCTATGGGA
GACGTTTTCG CTTCTGCCAG CCAGATGACC CGTGACTGGC ACTCAGAGGG GTACGGAACC
TTTCCTTCCG GTCATTTTCT GAAAGGCGAA GACGGCGCCA GATTGGACAG GATGCTGAAA
GAGGCGGAGA CGGAGATCGA CCCGGAAAAA CGCATCGAAA AGTATCGTGA GATTCAGAAG
TTCATCATGG AGCAAGCGTT GATGGTGCCG ATCTTCGAAC TGTACTTCTA TGTCGCCCAC
GCGAACAATC TGAAAGGCTT CGTCGTCGAT GGCACGGGTT TCTACAAGTA CTTTGCGCCA
GCGTACTTTG AGTAA
 
Protein sequence
MALRDLFVRP SSILMVLLIA GSLILAACGA TPQQSAPVPT QAGAAPPPTS VPATPADARP 
QATGRTRLVF FGNLEPASID VITMSGNVNE RQVAAQIFET LVYMDQSQQI YPGLAREWSR
SDDATTWTFK LVSGVKWHDG TPFTAQSVAD YFDYIRDNPV GSGPGSLKAV IGSAKAVDET
TVELTLTAPR PDLLIELADP GMGIGNAAYL KRVGADAGFN PVGTGPFKFK EWVRGSQIVL
ERNPEWTWGS PLFKMSGPPL IEEVVFRFSS EAQTRLAALE AGEVDFVDLL PFQDVVRVRT
DPRFTVTGVL LPGMPQMNYL NTSLAPTDDI NVRKAIIYAT DKQGIIESVY FNMVEPAYGP
LSRVFPEYEP ALEQMYEYNP EKAAQLLEEA GWLPGPDGVR VKDGRRLEVT IVENKGWNDW
VYVLQANLQA IGFDAKVLTT QGPSNTEAIA SGKYHVPAMG DVFASASQMT RDWHSEGYGT
FPSGHFLKGE DGARLDRMLK EAETEIDPEK RIEKYREIQK FIMEQALMVP IFELYFYVAH
ANNLKGFVVD GTGFYKYFAP AYFE