Gene Sare_4781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4781 
Symbol 
ID5704448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5412071 
End bp5414182 
Gene Length2112 bp 
Protein Length703 aa 
Translation table11 
GC content70% 
IMG OID641274179 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001539525 
Protein GI159040272 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component
[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000215564 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGAAT CGATCGGTCC AACCACCCGG CACGCCTCGT CGGCCGGCAG CCGACCCCTG 
CTGGAACTGC GCGACCTCGA CACCGACATC GCGCTACGCC GCGGCACGGT ACACGCCCTC
GACGGCGTCA GTCTCGAGGT CGCGCCCGGG CAGACCCTCG GCATCGTCGG CGAATCCGGC
AGCGGCAAGA CGATGACCGC CCTGTCGATC ATGGGTCTAC TTCCGTCCGG CGGTCGGGTC
GCCGGTGGGC AGATTTTGTT CGAGGGACGG GACCTGCGAT CGTTGCCACC CGACGAGGTG
CGCCGAATCC GAGGTGTTCG GATGGGCATG GTCTTCCAGG ACCCGCTCAC CTCGCTCAAC
CCCACCATGC GCATCGGCGC GCAGGTGGCC GAACCGTTGC GCGTGCACGA GCGGGTCGGC
CGGGCGGAGG CCCGGGAGCG GGCCGTCGAG ATTCTGCGCC GGGTCGGGAT GCCGCGTCCG
GAGCGGATCG TCGACAGCTA TCCGCATGAG CTGTCCGGCG GAATGCGGCA ACGGGTCGCC
ATCGCGATGG CACTGGTCTG CTCACCAAGC CTGCTCATCG CCGACGAACC GACCACGGCC
CTCGACGTCA CCACCCAGCG GCAGATCCTC GAACTCATCG ACGATCTGCG AGAAGAGTTC
GGGATGGCCG TCATCCTGGT CACCCACGAT CTCGGGGTAA TCGCCGGTCG GGCGGACCGG
GTCGCCGTCA TGTACGCCGG ACGGGTCGTG GAGACCGCGG CCACCGAGCA GTTGTTCCAC
GCGCCCCGAC ACCGCTACAC CGAGGCCCTG ATGCAGGCGC TACCCGAGTC GGCGATCCGG
GAGAACGGCG GACACGACCG GCTGAACAGC ATCCCAGGGC TGCCGCCGGA CCTGTCCGGC
CCCCTGACGG GCTGCCGGTT CGCGCCGCGG TGTTCGTATG TCGGTGACGA CTGCCGGACC
ACCGACCCGT CCCTGATCCC GGGCGCGCAC CAGCACGCGT GCCTGCACCC GGTACCCGCG
CCCACCGGGG CGGTCACCGG GCCGGTCCCC GCGCGGACCG AACCGGCGGC GACGCCGGTC
ACCGCGGCAC CCGAACTCGC GGCGGCGCCG GTCCTGTCGG TGCGGAACCT GGTCAAGAAC
TACCCGGCGC ACGGAGGTGG GCTACTGCGC CGCAGCGCCG GGCAGGTAAG CGCGGTGGCC
GACGTCTCCT TCGAGGTCAG GCCCGGGGAG ACCTTCGGGC TCGTCGGCGA GTCCGGCTGC
GGTAAGTCCA CTGTGGGACG ACTCTCCGTC GGCCTGGAAC AGCCCTCCGC CGGACAGATC
GTGTTGGACG GTACGGATCT CACCGACCTG ACCGGTCGGG AACGACGCCG AATGCACCGG
CAGGTCCAAC TCATGTTCCA GGACAGCTAC GCCGCGATGA ACCCTCGGAT GCGCGTCGAC
GCCATCCTCG CCGAACCGCT CGAGATTCAG AAGGTGGGTG ACGGTGCGGC ACGGCAAGCG
CGGATCGCCA CCCTCCTCGA CCAGGTGGGG TTGTCCCGGC GGGCGCTGGA ACGCTACCCC
CACGAGTTCT CCGGCGGTCA GTTGCAGCGG ATCGGGCTGG CCCGGTCGCT GGCGCTGCGC
CCCCGCCTGA TCGTCGGCGA CGAACCGGTG AGCGCACTGG ACGTGTCGAT CCAGGCCCAG
GTACTCAACC TGATGCGGGA CCTGCAGCGC GAGCTCGGTC TGGCGTACAT CTTCATCAGC
CATGACCTGT CCGTTGTCGA CTACATGGCG GATCGGATCG GGGTCATGTA CCTCGGCAAG
CTCGTCGAGA TCGGGCCCGC CCGGGACGTG GTGCGAGCCG CCCGGCATCC GTACACGCAG
GCCCTGGTCG ACGCGGTCCC GTCCATCACG CCGAGCTCGG CAGCCGCTGG CGACGGCATG
ACCATCCGGG GCGAGCTTCC CAGTGCCCTC GACCCACCCA CCGGATGCCG ATTCCGTACC
CGCTGCCCCC GCGCGGCGCA GATCTGCACG ACGGAGCCGC CGCTGGTCGG TGGCCTGCAC
CAGGTGGCCT GCCACCTGCC GCTACGCGCG GAGCCAAGCG GGACGGCCAC GCAGACCCAG
GCCGCTGTAT GA
 
Protein sequence
MSESIGPTTR HASSAGSRPL LELRDLDTDI ALRRGTVHAL DGVSLEVAPG QTLGIVGESG 
SGKTMTALSI MGLLPSGGRV AGGQILFEGR DLRSLPPDEV RRIRGVRMGM VFQDPLTSLN
PTMRIGAQVA EPLRVHERVG RAEARERAVE ILRRVGMPRP ERIVDSYPHE LSGGMRQRVA
IAMALVCSPS LLIADEPTTA LDVTTQRQIL ELIDDLREEF GMAVILVTHD LGVIAGRADR
VAVMYAGRVV ETAATEQLFH APRHRYTEAL MQALPESAIR ENGGHDRLNS IPGLPPDLSG
PLTGCRFAPR CSYVGDDCRT TDPSLIPGAH QHACLHPVPA PTGAVTGPVP ARTEPAATPV
TAAPELAAAP VLSVRNLVKN YPAHGGGLLR RSAGQVSAVA DVSFEVRPGE TFGLVGESGC
GKSTVGRLSV GLEQPSAGQI VLDGTDLTDL TGRERRRMHR QVQLMFQDSY AAMNPRMRVD
AILAEPLEIQ KVGDGAARQA RIATLLDQVG LSRRALERYP HEFSGGQLQR IGLARSLALR
PRLIVGDEPV SALDVSIQAQ VLNLMRDLQR ELGLAYIFIS HDLSVVDYMA DRIGVMYLGK
LVEIGPARDV VRAARHPYTQ ALVDAVPSIT PSSAAAGDGM TIRGELPSAL DPPTGCRFRT
RCPRAAQICT TEPPLVGGLH QVACHLPLRA EPSGTATQTQ AAV