Gene Sare_4210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4210 
Symbol 
ID5707948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4778659 
End bp4779696 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content67% 
IMG OID641273629 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001538982 
Protein GI159039729 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.384565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00766091 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCACTG ATGTGAACGT CCAACTGGAC GCCCTGCCCG GCCTCGACTC GGACGCTCTG 
CCGCTCGAGG TCAAGGATCT GCAGGTCGAG TTCCGCACCC GCAACGGCAT CGCCCGCGCG
GTCAACGGCG TCAGCTTCAA CCTGCGAGCC GGAGAGACCC GAGCGATCCT CGGTGAGTCC
GGCTGCGGCA AGAGCGTCAC CGCCCAGGCG ATCATGGGAA TCCTCGACAG CCCACCCGGA
TTCGTCACCG GCGGGGAGAT CCGCTACCGC GGCGTCGACC TGCTCAAGCT GCCCGAGGCG
CAGCGGCGGA AGGTCCGCGC CAACCGGATC GCGATGATCT TCCAGGACGC CCTCTCCGCG
CTGAACCCGG TCTTCACGGT CGGTTTCCAG CTTGGTGAGC TGTTCCGCAA GCACCGGGGC
ATGTCCCGGT CGGACAGTAA GGCGCGCGCC GTCGAACTGC TCGACCTGGT CAAGATCCCG
GCAGCGAAGC AGCGGGTGAA CGAATACCCG CACCAGTTCT CCGGCGGTAT GCGCCAGCGC
GTCATGATCG CTATGGCGTT GGCGCTCGAC CCGGAGGTGT TGATCGCCGA CGAGCCGACC
ACCGCCCTGG ACGTCACTGT GCAGGCCCAG ATCATGGCGC TGCTCGCCGA ACTACAGCGG
GAACGGAACA TGGGCCTGCT GCTGATCACG CACGACATGG GCGTGGTCGC CGACGTGGCG
GACCAGATCT CGGTGATGTA CGCGGGCCGG GTCATTGAGG AAGCCCCGGT CCGGGACATC
TACGAGAGCC CGGCCCACCC GTACACCAAG GGTCTGCTGG AGTCGATTCC ACGTCTGGAC
CTCAAGGGTC AGGAACTCTC CGCTATCAAG GGGCTGCCGC CGCTGCTGAC GGACATCCCC
AAGGGGTGCG CGTTCAACCC GCGGTGCCGG TATGCGCAGG ACGCCTGCCG TCAGGACCCG
GTGCCGCCGC TGTATCAGGT GGCACCGGTC CGAAGCGCCG CCTGCCACTT CTGGAAGGAG
GTCAAGGCCG ATGCCTGA
 
Protein sequence
MSTDVNVQLD ALPGLDSDAL PLEVKDLQVE FRTRNGIARA VNGVSFNLRA GETRAILGES 
GCGKSVTAQA IMGILDSPPG FVTGGEIRYR GVDLLKLPEA QRRKVRANRI AMIFQDALSA
LNPVFTVGFQ LGELFRKHRG MSRSDSKARA VELLDLVKIP AAKQRVNEYP HQFSGGMRQR
VMIAMALALD PEVLIADEPT TALDVTVQAQ IMALLAELQR ERNMGLLLIT HDMGVVADVA
DQISVMYAGR VIEEAPVRDI YESPAHPYTK GLLESIPRLD LKGQELSAIK GLPPLLTDIP
KGCAFNPRCR YAQDACRQDP VPPLYQVAPV RSAACHFWKE VKADA