Gene Rcas_1058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1058 
Symbol 
ID5538524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1373498 
End bp1375459 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content61% 
IMG OID640893194 
ProductABC-type dipeptide transport system periplasmic component-like protein 
Protein accessionYP_001431177 
Protein GI156741048 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTC TCGACAGGAA ACTTGCGCTG GCGATGCTGC TGGCGTTGAT AGTTCCAATT 
CTGGCAGCCT GTGGCGGCGG CGCGGCACAG GCGCCGGTCC GCGAAACGGT CGTGGTGACC
GCCGAGCCAA TCCGCGAAAC CGTGGTGGTC CGCGAGACGG TGGTGGCGGA AGCTCCTACG
GCGGCGCCAG CAGGGGCATT CACCACACCG CACCCGATTC TTGGCGACGT GCGCGTGCGT
CAGGCGATTG CCTACTGCAC CAACCGACCG GAACTGATCC AGTCGGTCTA TAGCTACCTG
ACGCCGGAGC AGCAGCAGGG GCTGCTGATG GACACCAACC TGCCCAAGAC GCACTGGGCG
GCGGCGAGCG AGGCGGACGG CATCGTGACC TATCCGTTTG ACCCGGAGAA AGGCAAGGCG
CTGCTCGATG AGGCGGGCTG GAAATTGGCA GAGGGCGCGA CTGTGCGCTC GAAAGATGGC
GTGCCGCTGT CGCTCGAGTT CACGACGACC AACGCACAGT TCCGTATCAC CTGGGCGACG
GTGCTGGAAA AACAGTTGCT GACCAACTGC GGCATCCAGA TCATCCGCAA GCACGCCCCG
GCATCCTGGT GGTTTGGTGG CGCCTCCGGT CTGCGTCGGC GCGATTTTGA GCTTGGCGCG
TTCGCATGGG TCGGCGAAGC GGATCCGGGC GGCCGCACGC TCTATGCCTG CGACCAGATT
CCGCTGCCAG AAAACAACTG GAATGGCCAG AACTACATGG GTTGGTGCAA TGAGACCGCC
AGCAAGGCGA TCACTGCAGC CAACAATACG CTGAACCGCG AGGAGCGCAT CAAGAACTAC
AAGGCGTTCC AGGTTGAGTT CACGAAGGAT ATGGTCAGCC TGCCGTTGTT CCAGCGCCTG
GAGGCGTATG CCTGGAATAA GGCGCTGAAG GGGTTGAAGC CTGACCCGAC CGAGTATATC
ACGGCCAACG CCTACGAGTG GGAGCGCAGC GATAATGGCG ACACCATTAT TCTGGGTTTC
ACCCAGGAGC CGGCTTCGAT GTTTACGCTG GTCGAGAGCG CGGCGGTGCA GCGCCAGGCG
GCGCAACTCG TCGAGGGCGT GTTGACGACC CAGTATAGCT ACGACTTCCA GGCGGTGCTG
CAGGATGGTC TCTCCACCAT TGAGAGCGGC AAGGCGAAGA ATGAGGTCGT TGAGGTCAAA
GAAGGCGATA AGGTCTGGGA TGTCACCGGC GCCGCAGTCG AACTCAAGCC GGGAGTGGAA
GTCGTCAACT CCGATGGCGA GACGGTCAAG TATGAGAGCG GCACGGTCAA GATGAACCAG
CTCACCGTCA CCTACGACCT CATCAAGGGG ATCAAGTGGT CCGACGGCGA GCCGCTGAAG
AAGGCGGACC TGGAACTGGC GGTGAAGATT GTCTGCGATC CTGAGTCGGG CAACGTCAGT
CTTACCTTCT GCGAGTCGCA CGACAATGCC AACGGTGTCA CCTTCAACAG CGACACCAGC
TACACGATTA AGTTCCTACC CGGTGTCCAG TGGCCAATCT ACTTCACGGC GCCGTATGGC
GGCTACCCCT CGCATGTGAC CATTTCGGAC GGGCGGAAGC TGGCGGATGT GCCGGCCAAG
GAGTGGGCGA CGCTGCCGGA GGTTGCTGAG TTGCCGCTGG GGTATGGTCC CTACATTCTG
AAGGAGTGGA ACAAGGGCCA GAATATGAAG TTCGAGGCGA ACCCGAACTT CGTCCTCGGC
GCTCCGAAGG TCAAGAATAT CGTGATCCAG TTCTATGCCG ACACCAACGC TGCCGTGGCG
GCGCTGCTGA CCGGTGAGGT CGATATTCTG GAGAAGGCGA CGCTCGGCGC CGGTCCTGAG
GTGGAGACGG TGCTCAAGGC GGCGGCGGAG GGCAAGATTG AAGCCAAGAC CGATGCCAGC
CCAACCTGGG AGCACATGGA TATGAACCTG TTTGTCCGGT AG
 
Protein sequence
MKILDRKLAL AMLLALIVPI LAACGGGAAQ APVRETVVVT AEPIRETVVV RETVVAEAPT 
AAPAGAFTTP HPILGDVRVR QAIAYCTNRP ELIQSVYSYL TPEQQQGLLM DTNLPKTHWA
AASEADGIVT YPFDPEKGKA LLDEAGWKLA EGATVRSKDG VPLSLEFTTT NAQFRITWAT
VLEKQLLTNC GIQIIRKHAP ASWWFGGASG LRRRDFELGA FAWVGEADPG GRTLYACDQI
PLPENNWNGQ NYMGWCNETA SKAITAANNT LNREERIKNY KAFQVEFTKD MVSLPLFQRL
EAYAWNKALK GLKPDPTEYI TANAYEWERS DNGDTIILGF TQEPASMFTL VESAAVQRQA
AQLVEGVLTT QYSYDFQAVL QDGLSTIESG KAKNEVVEVK EGDKVWDVTG AAVELKPGVE
VVNSDGETVK YESGTVKMNQ LTVTYDLIKG IKWSDGEPLK KADLELAVKI VCDPESGNVS
LTFCESHDNA NGVTFNSDTS YTIKFLPGVQ WPIYFTAPYG GYPSHVTISD GRKLADVPAK
EWATLPEVAE LPLGYGPYIL KEWNKGQNMK FEANPNFVLG APKVKNIVIQ FYADTNAAVA
ALLTGEVDIL EKATLGAGPE VETVLKAAAE GKIEAKTDAS PTWEHMDMNL FVR