Gene Sare_4269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4269 
Symbol 
ID5705774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4845279 
End bp4846304 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content64% 
IMG OID641273688 
ProductNMT1/THI5-like domain-containing protein 
Protein accessionYP_001539041 
Protein GI159039788 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.933987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0127409 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGGC TGACCCGTAC GGTCGCCGCA GCCACGACGG CCGCTGCCCT GATGTTGGTC 
GGGGCATGTA GCGGCTCGGA CTCCACTGAC GACAAGGGGG GTGACAGCGG TGCGCTGGAG
CAAGTAACCT ACCTCACCTC ATTTGGGAAC TTCGGCCGTG ATTCCTACGC CTGGGTGGCG
AAGGAAAAGG GCTTCTTCCG GGACGCGGGC TTTGACGTCG ACATCAAGGC GGGGAAGGGC
ACCGGTGCCG TTATCCAGAC GGTCTCCGGA GGCAAGGCGC ATTTCGGGCC GATCGACCTC
ACCGGAGGTT TGCTCCAGTT TGGCAACGGC GAGGCAAAGG ACTTCGTCGT CGTGGCCGCG
ATCCAGCAGC GCACCATGGC CGGCATCGCC ACCGTCGAGG GCACGAACAT CACCACCCCG
AAGGATCTTG AGGGTAAGAA GATCGCGGAC GCCCCCGCCT CCGTGGTCCG CAACCTCTTC
CCCACGTACG CCAAGATGGC CGGCGTCGAC GCGAGCAAGG TGACCTGGGT CAACGGTGCG
CCGCAGGACC TGATGGGTAC CCTCGCCGCG GGCACCGTTG ACGGCATCGG GCAGTTCGTG
GTTGGCCAGC CGACCATTGA GGCGGTGGCC AAGAAGAAGG CGATCATGCT GCCGTACAGC
GAGTACATGC AGGATCTCTA CGGCAACGTG CTGATCACGT CGACAACGAT CGCCAAAGAG
CAGCCGGACA TGGTCAAGCG TTTCCGCGAC GCTCTGCTCA AGGGCTTGGA CTACGCGTTG
GCCAATCCGC AGGAGGCAGC TGAGCTGCTG AAGAAGAACG TGGACTCGAC GAACGTCGAC
GCCGCCAGGT CGGAACTGGA ACTGATGGCC GGCTACGTGC GGTCCAGCAA CAGCGGTGCC
CAGCTGGGCA CGGTGGACAG CGCCCGGGTG GCGCAGAGCA TTGCCATCCT GCAGGGCGCG
GGCGCGCTCA AGCAGACCCT CGATCCCGAC GAGATCATCG ACTTCAGTCT CACGCCGAAG
GCCTGA
 
Protein sequence
MSRLTRTVAA ATTAAALMLV GACSGSDSTD DKGGDSGALE QVTYLTSFGN FGRDSYAWVA 
KEKGFFRDAG FDVDIKAGKG TGAVIQTVSG GKAHFGPIDL TGGLLQFGNG EAKDFVVVAA
IQQRTMAGIA TVEGTNITTP KDLEGKKIAD APASVVRNLF PTYAKMAGVD ASKVTWVNGA
PQDLMGTLAA GTVDGIGQFV VGQPTIEAVA KKKAIMLPYS EYMQDLYGNV LITSTTIAKE
QPDMVKRFRD ALLKGLDYAL ANPQEAAELL KKNVDSTNVD AARSELELMA GYVRSSNSGA
QLGTVDSARV AQSIAILQGA GALKQTLDPD EIIDFSLTPK A