Gene Sare_4179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4179 
Symbol 
ID5703967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4747069 
End bp4748163 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content69% 
IMG OID641273606 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001538959 
Protein GI159039706 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00200112 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCCTCA TGCGCGGGAG CAGGCACACC CGTGGCAGCG AGCGAAAGGC CGACACCGTG 
TCCCAGTCCA CCGTCCAACC CACGCCCGCC TCCCCCACCC CGGAGGGCGG CCACCTACTC
GAGGTACGCG ACCTGCACGT CGAGTTCCGC ACCGGGGAGG GCGTGGCCAA GGTGATCAAC
GGTGTCTCCT ACCACTTGGA CGCCGGGGAG ACCCTGGCCG TGCTCGGCGA ATCCGGCTCC
GGCAAATCGG TCACCGCCCA GGCGATCATG GGTATCCTCG ACACCCCGCC CGCCGTGATC
CGCGCCGGAC AGATCCGCTA CCAGGGACGA GACCTGCTCG CCCAGTCAGA GGAGCAACGC
CGGCAGGTAC GCGGCACCGA GATCGCGATG ATCTTCCAGG ACGCCCTGTC CGCGCTGAAC
CCGACGTTCC CGGTCGGCTG GCAGATCGGT GAGACGCTCC GCCAACGCGC CGGGATGTCC
CGCGGCGACG CCCGTCGCCG CGCGATCGAA CTGATGGACC TGGTGAAGAT CCCCGCCGCG
GCCAACCGGC TCGGCGACTA CCCACACCAG TTCTCCGGCG GGATGCGGCA ACGCGTCATG
ATCGCCATGG CGCTGGCACT GAACCCGAAG GTGCTCATCG CCGACGAGCC GACCACCGCG
CTGGACGTGA CCGTGCAGGC CCAGATCATG GACCTGCTGG CCGACCTACG CCGAGACCTG
CGCATGGCGA TGATCCTGAT CACCCACGAC CTCGGCGTGG TGGCCGGCGT CGCCGACCGC
ATCGCCGTCA TGTACGCCGG CCGGATCGTC GAACACGCCG ACGTCCGGTC GCTGTACCGG
TCACCCGCAC ACCCGTACAC CAAGGGGCTG TTGGAGTCGA TCCCACGGCT GGACGTCCAC
GGCCAACAGC TGTCGACTAT CCGAGGCCTA CCGCCGAACC TGATGCGGAT TCCCTCCGGC
TGCCCCTTCC ACCCCCGGTG CCCGTACGTC CAGCAGGTCT GCGTGGACGT CGTACCGCAC
GACCTGGTCC TCGGCGACGG CCGAACCAGC GCGTGCCACT TCGCGCAGGA GGTCCGCGAT
GACCGCGCCC GCTAG
 
Protein sequence
MTLMRGSRHT RGSERKADTV SQSTVQPTPA SPTPEGGHLL EVRDLHVEFR TGEGVAKVIN 
GVSYHLDAGE TLAVLGESGS GKSVTAQAIM GILDTPPAVI RAGQIRYQGR DLLAQSEEQR
RQVRGTEIAM IFQDALSALN PTFPVGWQIG ETLRQRAGMS RGDARRRAIE LMDLVKIPAA
ANRLGDYPHQ FSGGMRQRVM IAMALALNPK VLIADEPTTA LDVTVQAQIM DLLADLRRDL
RMAMILITHD LGVVAGVADR IAVMYAGRIV EHADVRSLYR SPAHPYTKGL LESIPRLDVH
GQQLSTIRGL PPNLMRIPSG CPFHPRCPYV QQVCVDVVPH DLVLGDGRTS ACHFAQEVRD
DRAR