Gene Sare_0191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0191 
Symbol 
ID5706327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp205219 
End bp206637 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content67% 
IMG OID641269717 
Productmajor facilitator transporter 
Protein accessionYP_001535117 
Protein GI159035864 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000226974 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGCCCGTCA AACCCACTCC CGACGACCCG ACCCCGGACG AAACGGCTGC CGGCCAGAAC 
AGCCGCGACG ATCGTGGGCG CAACCGGGTC CTCACCCTGT CCACCATCGG CTTCACCGTC
ATGTTCGCGG TGTGGCTCAT GTTCGGCATC CTCGGCAAGC CGATCAGCGA CGAGTTCAAC
CTCTCCGAGG TGCAACTGTC CTGGATCATC GCCGCCGCCG TGCTCAACGG CTCGCTCTGG
CGGCTCCCCG CCGGCATCGT GGCGGACCGC ATCGGCGGAC GCCGAGTGAT GACGGCGATG
CTTCTCCTGA CCGCGCTGGC CTCGTTCCTC GTGTCCCGCG CCAATTCCTA CCCGATGCTG
CTGGCGCTGG CGTTTCTGGT CGGCTTCGCC GGCAACTCGT TCACCGCTGG TATCGCCTGG
AACTCGGCGT GGCAGCCGCG GGAGAAGCAG GGTTTCGCGC TCGGCCTGTT CGGTGCGGGC
AACGTCGGCG CATCGGTAAC CAAGTTCATC GGCCCGCCGC TGATCGCGGG AACCGCGGGC
GCCACCTACC TCGGCGTTAT CGAGGGTGGT TGGCGCCTCG TCCCCGTCGT CTACGCGGTG
TTGCTGCTCG TCCTCGCGGC GGCTACGTGG TTCCTCACCC CCCGCCGCGA CCGCGTGCCA
AGCCACGGCA CCCCGCTGCG CGAACAGCTC GAACCGCTCA AGCAGATACG AGTGTGGCGA
TTCAGCCTGT ACTACGTGGC GGTGTTCGGG GCCTATGTGG CGCTCGCCGC GTGGCTGCCG
ACCTACTACA TGAACAACTA CGACGTGTCG CTGCAGACCG CGGCCTATCT GACCGCCCTG
TACATCTTCC CCGCCTCGCT GCTGCGACCG GTCGGCGGGT CGTTGTCCGA CCGTCTGGGT
GCCCGCCGCG TCATGTACTG GACATTCGGC CTCATGCTGC TCAGCACGGG CATCCTGATG
ATGCCCCCGG GCCACATCGT CGTCGACCAC CCCGATGGCA CGCAGACCAG TCACCTCGCC
TACCAGCTCG GCATCGTGCC CTTCACTGTT CTGGTCGTCC TGCTCGGCTG CGCCATGGGC
GTCGGCAAGG CCGCGGTGTA CAAGCACATC CCCGAGTACT TCCCGCGCCA GGTCGGGGCC
GTGGGCGGTC TGGTCGGCAT GCTCGGCGGC CTCGGCGGGT TCTTCCTGCC CCCGATGTTC
GCCTACACCA AGGCGTGGAC GGGCCTCCCC TCCAGCACCT TCCTGGTCCT GTTCATACTC
ACCGCTATTT GCGCTGTTTG GATGCACCTG ACCGTGGTTC GCATGTTGCA CGGTGAATCG
CCCCAGCTTG CCGACCATTT CGAGAAGCCA GAACCCGTTG ACCAGCCGAC CGCTCCGGCT
ACGGCGGCGA CCCGTGTGCC TGAGGAGGCC CGCGAATGA
 
Protein sequence
MPVKPTPDDP TPDETAAGQN SRDDRGRNRV LTLSTIGFTV MFAVWLMFGI LGKPISDEFN 
LSEVQLSWII AAAVLNGSLW RLPAGIVADR IGGRRVMTAM LLLTALASFL VSRANSYPML
LALAFLVGFA GNSFTAGIAW NSAWQPREKQ GFALGLFGAG NVGASVTKFI GPPLIAGTAG
ATYLGVIEGG WRLVPVVYAV LLLVLAAATW FLTPRRDRVP SHGTPLREQL EPLKQIRVWR
FSLYYVAVFG AYVALAAWLP TYYMNNYDVS LQTAAYLTAL YIFPASLLRP VGGSLSDRLG
ARRVMYWTFG LMLLSTGILM MPPGHIVVDH PDGTQTSHLA YQLGIVPFTV LVVLLGCAMG
VGKAAVYKHI PEYFPRQVGA VGGLVGMLGG LGGFFLPPMF AYTKAWTGLP SSTFLVLFIL
TAICAVWMHL TVVRMLHGES PQLADHFEKP EPVDQPTAPA TAATRVPEEA RE