Gene RoseRS_0413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0413 
Symbol 
ID5207349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp527559 
End bp529520 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content61% 
IMG OID640594039 
ProductABC-type dipeptide transport system periplasmic component-like protein 
Protein accessionYP_001274794 
Protein GI148654589 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.457116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTC TCGACAGGAA GATCGCGCTG ACCCTGTTGC TCGCGCTGGT TGCGCCGATC 
CTGGCAGCCT GCGGCGGCGG GACGGCACAG CAACCGATCC GCGAAACGGT GGTCGTAACG
GCAGAGCCGG TTCGGGAGAC GGTCGTTGTG CGCGAGACGG TTGTGGCTGA GGCGCCGACG
GCTGCTCCAT CAGGGGCGTT CACCACACCG CACCCGATCC TCAGCGATGT GCGGGTGCGC
CAGGCGATTG CGTACTGCAC CAACCGCCCC GAGTTGATCC AGTCGGTCTA CAGTTACCTG
ACGCCGGAGC AGCAGCAAGA GTTGCTGATG GATACCAACC TGCCGAAGAC GCACTGGGCG
GCAGCCAGCG AAGCGGACGG GATCGTGGTC TATCCGTTCG ACCCTGATAA GGGCAAGGCG
CTGCTCGAAG AGGCGGGCTG GAAACTGCCT GAAGGTCAGC GTGTGCGCGC GAAGGATGGC
GAGCCGCTGT CGCTCGAGTT CACCACCACC AATGCGCAGT TCCGCATCAC CTGGGCAACC
GTGCTGGAGA AGCAGTTGCT CGATAATTGC GGCATCCAGA TCATCCGCAA GCATGCCCCG
GCATCCTGGT GGTTCGGTGG CGCGTCCGGT CTGCGCCGGC GCGACTTTGA ACTTGGCGCA
TTCGCCTGGG TTGGTGAAGC CGATCCCGGT GGTCGCACCC TGTATGCCTG TGACCAGATT
CCGCTGCCGG ATAACAACTG GAATGGGCAG AACTATATGG GCTGGTGCAA CGAGACGGCC
AGCAAGGCGA TCACTGCGGC GAACAATACG CTGAACCGCG AGGAGCGCAT CAAGAACTAC
AAGGCGTTCC AGGTCGAGTT CACCAAGGAC ATGGTCAGCC TGCCGCTGTT CCAGCGCCTC
GAGGCGTATG CCTGGAACAA GGCGTTGAGG GGTCTCAAGC CCGACCCGAC CGAGTATATC
ACCGCGAATG CGTATCAGTG GGAGCGCGCC GATGGCGGCG ACACCATTAT CCTGGGCTTC
ACTCAGGAGC CAGCTTCGAT GTTCACGCTG GTCGAGAGCG CGGCAGTGCA GCGCCAGGCG
GCGCAACTCG TCGAAGGCGT GCTGACCACC CAGTATAGCT ACGACTTCCA GCCGGTGCTG
CAGGATGGTC TTGCGACGAT TGAGTCCGGC AAGGCAAAGA ATGAGGTCGT CGAAGTCAAA
GAAGGTGATA AGGTCTGGGA CGCGACGGGC GCTGCCGTTG AACTCAAGCC GGGCGTTGAG
ATCATCAACT CCGACGGTGA GACGGTCAAG TACGAGAGTG GCACGGTGAA GATGAACCAG
CTCACCGTCA CCTACGATCT CATCAAAGGC ATTAAGTGGT CGGACGGCGA ACCGCTGAAG
AAGGCCGACC TGGAACTGGG CGTCAAGATC GCGTGCGATC CCGACTCAGG GGCGGTCAGC
CTGACCTTCT GCGAGTCGCA CGATAATCTC AACGGTGTCA CCTTTAACAG CGATACCAGC
TACACCATCA AGTTCCTGCC GGGCGTGCAG TGGCCAATTT ACTTCACTGC TCCCTACGGC
GGCTACCCCT CGCATGTGAC CGTTTCGGAC GGGCGGAAAC TGGCGGATGT GCCGGCGAAA
GAGTGGGCAA CTCTGCCGGA AGTCGCCGAA ATTCCGCTGG GGTATGGTCC ATACATCCTG
AAGGAGTGGA AGAAGGGCGA GTTCATGAAG TTCGAGGCAA ACCCGAACTT CGTGCTCGGC
GCGCCGAAGG TCAAGAATGT CATCATCCAG TTCTACGCCG ACACCAATGC GGCAGTAGCA
GCGCTGCTGA CCGGCGAGGT CGACATTCTT GAGAAGGCGA CGCTCGGCGC CGGTCCGGAG
GTGGAAACGG TGCTCAAGGC GGCGGCAGAG GGCAAGATCG AAGCCAAGAC CGACGCCAGC
CCGACCTGGG AGCACATGGA TATGAACCTG TTCATCCGGT AA
 
Protein sequence
MKILDRKIAL TLLLALVAPI LAACGGGTAQ QPIRETVVVT AEPVRETVVV RETVVAEAPT 
AAPSGAFTTP HPILSDVRVR QAIAYCTNRP ELIQSVYSYL TPEQQQELLM DTNLPKTHWA
AASEADGIVV YPFDPDKGKA LLEEAGWKLP EGQRVRAKDG EPLSLEFTTT NAQFRITWAT
VLEKQLLDNC GIQIIRKHAP ASWWFGGASG LRRRDFELGA FAWVGEADPG GRTLYACDQI
PLPDNNWNGQ NYMGWCNETA SKAITAANNT LNREERIKNY KAFQVEFTKD MVSLPLFQRL
EAYAWNKALR GLKPDPTEYI TANAYQWERA DGGDTIILGF TQEPASMFTL VESAAVQRQA
AQLVEGVLTT QYSYDFQPVL QDGLATIESG KAKNEVVEVK EGDKVWDATG AAVELKPGVE
IINSDGETVK YESGTVKMNQ LTVTYDLIKG IKWSDGEPLK KADLELGVKI ACDPDSGAVS
LTFCESHDNL NGVTFNSDTS YTIKFLPGVQ WPIYFTAPYG GYPSHVTVSD GRKLADVPAK
EWATLPEVAE IPLGYGPYIL KEWKKGEFMK FEANPNFVLG APKVKNVIIQ FYADTNAAVA
ALLTGEVDIL EKATLGAGPE VETVLKAAAE GKIEAKTDAS PTWEHMDMNL FIR