Gene Sare_4598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4598 
Symbol 
ID5706619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5216949 
End bp5218433 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content69% 
IMG OID641274002 
ProductABC transporter related 
Protein accessionYP_001539349 
Protein GI159040096 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1122] ABC-type cobalt transport system, ATPase component
[COG1126] ABC-type polar amino acid transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0433816 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCA TCGACCACGC TACCTGGACC TACCCGCACG CCGAGCAGCC GAGCCTGCGA 
GACCTCACCC TGCGCGTCAA CCCGGGTGAG TTCGTGATCC TCTGCGGCGC GTCAGGATCC
GGCAAATCCA CCGCACTCCG GCTCATGAAC GGCCTCATTC CGCACTTCCA CGAGGACGGC
GTCCTCACCG GCACCGTCAC TGTCGGTGGG CTGGTCACGA CCAATGCCGA GCTGGACGCT
ATGGGCCTCG TCACCGGCAC CGTGCTGCAG CATCCGAAGC GCCAGTTCTT CACCGACACC
GCTCCGGAGG AGGTCGCCTT CGCGATGGAG AACTTCGGCT TCCCCCCGGA AGAGATCCGG
CGCCGCGTCG TGGAGACGGT CGAGGAGCTC GCCACGGGGG TGCCGGTCGA ACAACGCCTG
CGGGATCTCT CCGGCGGTCA GCAGCAACAG GTCGCGATCG CCGCTGCGAT CGCCCACCGC
CCCAGCGTCC TCCTGCTGGA TGAGCCCAGC TCGAACCTCT CGTCCGACGC GGTGCAGCGC
CTCACCGCCA CGCTCGCCAG CCTCAAGGCG CAAGGAGTGA CGATCGTGAT CGCCGAACAC
CGACTGCGCT ACCTGGAAGA CCTCGTCGAC CGGGTCATCG TGATGCGCGA CGGCGCGATT
GACGTCGAGT GGCCCGCGGC GCAACTCCGT GCCGTGCCGG ATGACGAGCT CGCCCGCGAG
GGACTCCGCG GGGTCGTGAG CACGGTCGAT CTGCCGGCCC TGCCGGCATC AGGCGCCAGC
ATCGTCGCAG GAGCCGACGC ATCGGAGATC CCGGGCGCCG CGCTCGAACT GGAGGCGATC
CGCTGCCGCC TCGGCGGGCG CATCGTGCTC GACATCGACC GCGTCGCCTT CGCGGACGGC
TCGGTCACCG CGGTCCGCGG AGTCAACGGT GCAGGCAAGT CCACTTTCGC TCGAATCATG
ACCGGCCTGC AACGCAGCAC CGGCACCGTC TTCCTCGATG GGAAGGCGCT GAACCCGCGG
GCACGCCAGC GCGCGAGCGC GATCGTCATG CAGGACGTCC AGCGGCAGCT TTTCACCGAC
AGCGTCAAAG CCGAGATCCA CCTCGCCGGC ACCGACACCC CCGAGGCTCC TGATACGGAT
ACGGTGCTCG ATGCCCTCGA CCTCGCGCAC CTCGCCGACC GGCATCCGCT GTCGCTCTCC
GGTGGCCAAC AGCAGCGCCT CGTCGTGGCT GCCGTCCGGG TTGCTGGCCG ACGCATCGTC
GTGTTCGACG AGCCCAGCTC CGGCGTGGAC CGCCGCCACC TGCGGTCCAT CGCCGACCAG
ATCCGCCGCC TCGCCGCCGA CGGCGCCATC GTCCTACTCA TCAGCCATGA CGACGACCTG
CTCGCGCTCG CCGCAGACCG GCAACTCACT CTGGCCCCAC CGCTGAGCTC GTCGCGGAAC
CGGCATGGCG CTCACGGAGA ACCGACCGTT GAGGAAACCC GATGA
 
Protein sequence
MIRIDHATWT YPHAEQPSLR DLTLRVNPGE FVILCGASGS GKSTALRLMN GLIPHFHEDG 
VLTGTVTVGG LVTTNAELDA MGLVTGTVLQ HPKRQFFTDT APEEVAFAME NFGFPPEEIR
RRVVETVEEL ATGVPVEQRL RDLSGGQQQQ VAIAAAIAHR PSVLLLDEPS SNLSSDAVQR
LTATLASLKA QGVTIVIAEH RLRYLEDLVD RVIVMRDGAI DVEWPAAQLR AVPDDELARE
GLRGVVSTVD LPALPASGAS IVAGADASEI PGAALELEAI RCRLGGRIVL DIDRVAFADG
SVTAVRGVNG AGKSTFARIM TGLQRSTGTV FLDGKALNPR ARQRASAIVM QDVQRQLFTD
SVKAEIHLAG TDTPEAPDTD TVLDALDLAH LADRHPLSLS GGQQQRLVVA AVRVAGRRIV
VFDEPSSGVD RRHLRSIADQ IRRLAADGAI VLLISHDDDL LALAADRQLT LAPPLSSSRN
RHGAHGEPTV EETR