Gene Rcas_2420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2420 
Symbol 
ID5539901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3113063 
End bp3114172 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content57% 
IMG OID640894550 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001432518 
Protein GI156742389 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACCG CAAAGAAAAA TGGCACATCC GACGAGATTT TGCTCGAAGT TCGTGGGCTA 
AAGAAGCATT TTCCGATTCA GAGCGGTTTC CTCCGGCGCG TGACCGGGTA TGTCAAAGCC
GTCGATGGTA TTGATTTCTA TATCAAGAAG GGTGAGACCC TCGGTCTCGT CGGTGAGTCA
GGGTGCGGCA AGAGTACGAC CGGACGCACC ATCCTTCGCC TCCTCGATCC GACCGCAGGC
GAGATTATTT TCGATGATCC GAATATTGGC AAGGTAGACC TGGCAAAACT CAATCGCGCG
CAACTCACGC GGGTCCGCCC GAATATGCAG ATTATCTTCC AGGACCCCTT CTCGTCACTG
AATCCGCGTC TGACGGTCGG GCAGATCGTT GGTGAACCGC TTGAAATCCA GAAGGTTGCT
TCCGGTCAGG CGCTCAAGGA TCGCGTCGCT GAGTTGCTCC AGGAAGTCGG TATCCGCCCT
GAGAATATGA CGCGCTACCC GCACGCTTTC TCCGGTGGGC AACGCCAACG TATCGGTATT
GCGCGCGCCC TGGCGCTCAA CCCAAAACTG ATCGTCTGCG ATGAGCCGGT GTCGGCGCTC
GATGTGTCGA TTCAGGCGCA GGTGCTCAAT CTTCTCGAAG ATCTTCAGGA GAAGTACGAC
CTGACCTACC TCTTTGTGGC GCACGATCTG AGCGTTGTTG AACATATTTC CGACCGGGTG
GCGGTGATGT ACGTGGGCTA TATCGTCGAA ATGGCAAGCA CCGAAGAACT CTATTACCAC
CCCAAACATC CGTACACCGA GGCGCTCCTT GCTGCTATTC CGAAACCCGA TCCGCGCAAA
CGCACGCGCC CGATCAAACT CCCTGGCGAC GTGCCCAGCC CGGCGAATCC TCCTTCGGGG
TGCTATTTCC ATCCGCGCTG CCGGTATGCT GAGGAGATCT GTAAGGTCGA ACGTCCACCG
CTGCGCGATA TTGGTGGTGA GCACTGGGTT GCCTGTCATT TCGCTGAGCA GTTGCAGTTG
CAGGGTGTGA CGCGCCTGAA CGAAATCCCG CTCATTGAGC TTCCCAAGCG CCAGGCGTCT
GTGCCGGCGA CAACGACAGC AACGACGTAG
 
Protein sequence
MDTAKKNGTS DEILLEVRGL KKHFPIQSGF LRRVTGYVKA VDGIDFYIKK GETLGLVGES 
GCGKSTTGRT ILRLLDPTAG EIIFDDPNIG KVDLAKLNRA QLTRVRPNMQ IIFQDPFSSL
NPRLTVGQIV GEPLEIQKVA SGQALKDRVA ELLQEVGIRP ENMTRYPHAF SGGQRQRIGI
ARALALNPKL IVCDEPVSAL DVSIQAQVLN LLEDLQEKYD LTYLFVAHDL SVVEHISDRV
AVMYVGYIVE MASTEELYYH PKHPYTEALL AAIPKPDPRK RTRPIKLPGD VPSPANPPSG
CYFHPRCRYA EEICKVERPP LRDIGGEHWV ACHFAEQLQL QGVTRLNEIP LIELPKRQAS
VPATTTATT