Gene Sare_4255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4255 
Symbol 
ID5704387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4828304 
End bp4829674 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content68% 
IMG OID641273674 
Productnitrite transporter 
Protein accessionYP_001539027 
Protein GI159039774 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID[TIGR00886] nitrite extrusion protein (nitrite facilitator) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.823889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00256557 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGACGA CGAGCGTGCC CGCCACGACG GGGGACGAGG AGATCGACCT GCAACGCCGG 
AAGGGTCGCT GGATCGGCTA CTGGGCGCCC GAGGACGACC GCTTCTGGCG GACCGCCGGC
CGGGCCGTCG CCCGGCGCAA CCTCATCTAC TCGATCTTCG CCGAGCACAT CGGGTTCTCC
GTCTGGCTGC TCTGGAGCAT CGTGGTGGTC CGGTTGGACG ACGTCGGATG GACGCTGACG
ACCAGCCAGG CGCTCTGGCT GACTGCCGTG CCCAGCGGTG TCGGCGCGCT GCTGCGACTG
CCCTATACCT TCGCCGTGCC GATTTTCGGC GGCCGGAACT GGACCGTCAT CTCCGCGCTG
CTGCTGATCA TCCCGTGTGC CGGGCTGGCG TGGGCGGTCC AGCATCCGGA AATCGGGTTC
ATGCCGCTGC TGCTGATCGC CGCCACCGCC GGCCTCGGCG GCGGTAACTT CGCCTCCAGC
ATGGCGAACA TCTCGTTCTT CTACCCCGAG CGGGAGAAGG GGTGGGCGCT CGGGTTGAAC
GCGGCCGGCG GCAACATCGG TGTCGCCGTG GTGCAGTTCC TGGTGCCTCA GGTGATCGTG
CTCGGTGGCG GCCTGGCGTT GGCCAGGGCC GGGCTGATGT ACCTTCCGCT CGCGGTGATC
GCCGCGGTCT GCGCCTTTCT GTTCATGGAC AACCTGGTCG AGGCCAAGGC GGACGTGGGA
TCGGTGTGGT CCTCGTTGCG GCACCGGGAC ACGTGGATCA TGTCATTGCT GTACATCGGT
ACGTTTGGTT CCTTCATCGG CTACTCGGCG GCCTTTCCGA CGTTGCTCAA CGGGGTGTTC
GGCCGACCCG ACATCGCGCT GTCCTGGGCG TTCCTCGGTG CGGCAGTGGG CTCGGTCTGC
CGACCCTTCG GGGGCCGCCT CGCGGACGCC ATCGGTGGCG CCCGGGTCAC CGTGGCCAGC
TTCGTGCTGA TGACCGGCGG TGCCTACCTG GCCCTGTGGT CGGTGCGGGA ACGCTGGCTG
GGAGTCTTCT TCCTGGCGTT CATGCTGCTG TTCGTGGCCA CCGGGGTCGG CAACGGGTCG
ACGTACCGGA TGATCTCCCG GATCTTCCAG GTGCAGGGGG AGAAACTCGG CGGCTCACCG
GAGATCATGC GGGCGATGCG CCGGCAGGCA GCCGGGGCAC TCGGAATCAT CTCCGCGGTC
GGTGCCTTCG GCGGGTTCCT GGTCCCGATC TGCTACGCAT GGGCGAAGTC GGCCTACGGC
AGCATCGAGC CCGCGCTGTG GTTCTATGTC GGCTTCTTCC TGGTGCTGAC GGTGCTGACG
TGGGGGGTGT ATCTGCGACC GGGGGCGCGG CTGACCGGGG ATCGGGTGTG A
 
Protein sequence
MTTTSVPATT GDEEIDLQRR KGRWIGYWAP EDDRFWRTAG RAVARRNLIY SIFAEHIGFS 
VWLLWSIVVV RLDDVGWTLT TSQALWLTAV PSGVGALLRL PYTFAVPIFG GRNWTVISAL
LLIIPCAGLA WAVQHPEIGF MPLLLIAATA GLGGGNFASS MANISFFYPE REKGWALGLN
AAGGNIGVAV VQFLVPQVIV LGGGLALARA GLMYLPLAVI AAVCAFLFMD NLVEAKADVG
SVWSSLRHRD TWIMSLLYIG TFGSFIGYSA AFPTLLNGVF GRPDIALSWA FLGAAVGSVC
RPFGGRLADA IGGARVTVAS FVLMTGGAYL ALWSVRERWL GVFFLAFMLL FVATGVGNGS
TYRMISRIFQ VQGEKLGGSP EIMRAMRRQA AGALGIISAV GAFGGFLVPI CYAWAKSAYG
SIEPALWFYV GFFLVLTVLT WGVYLRPGAR LTGDRV