Gene Avin_23460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_23460 
SymbolnasS 
ID7761262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2346704 
End bp2348026 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content67% 
IMG OID643805228 
Productnitrate/nitrite transport system substrate-binding protein ; NasS 
Protein accessionYP_002799509 
Protein GI226944436 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.143963 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTCGT TTTCGGCCCT GCCCCAGGCA GTTGTCGAGG CGACGCCTTT TCGTCCAGCG 
CCGGAAAACA CCATGACAGA CCACCACGCA ACTTCCAGAG CGGATGCCCG GGCCTGGGTC
GCGGGCAGTG ATGCCCCCGA GAAGAGCTCG ATCAACCTGG GCTTCATGCC GCTCACCGAC
TCCGCCTCGC TGATCGTCGC CGCGACCCAG GGTTTCGCCG AACCCTATGG GCTGACCCTC
AATCTCAAGC GCCAGGTATC CTGGTCGGGC CTGCGCGACA AGTTGCTCAG CGGCGAACTG
GATGCCGCCC AGGGACTGTA CGGACTGATC TACAGCATGC AGCTCGGCAT CGGCGGCGCA
CCGGCGACCG ACATGGCGGT GCTGATGGGC CTCAACCAGA ACGGCCAGAG CATCAACCTG
TCGACCCCGC TCCGGCAAGC CGGGGTGTGC AGTGGCGAGA CACTGGTCCG GCATGTGCGC
CAGAGCAGCG CGAAGCTCAC CCTGGCCCAG ACCTTTCCGA CCGGCACCCA TGCCCTCTGG
CTCTACTACT GGCTCGCCAG CCTGGGCATC CACCCCCTCG CGGACGTGAA CACCCTGGTG
GTGCCGCCGC CGCAGATGGT CGAGCACTTG CGCGCCGGCC GCATCGACGG TTTCTGCGCC
GGAGAGCCCT GGGGCGCCCA CGCCATCGAC CAGGGCATGG GTTTCACCAT CGCTACCAGC
CAGTCGATCT GGCCGGACCA CCCGGAGAAA GTCCTCGGCT GCACCCGTGC CTTCGCCGAG
CAGTACCCCA ATACCGCCCG CGCCCTGATC ATGGCCGTGC TGGAAGCGAG CCGCTTCATC
GACGCCAGCG AAGAGAACAA GGCCGGCACC GCGCAACTGA TCAGCGCCGA CGAATACGTG
GCCGCACCGC GCCAGGTGAT CGAACCGCGC TTTCTCGGCG ACTACGAGGA CGGCAACGGC
CATGCCTGGC GCGACAGCCA TGCCCTGCGC TTTCATGGCG ACGGCGAGGT CAACCTGCCC
TATCTCTCCG ACGGTCTCTG GTTCATGACC CAGTTCCGCC GTTGGGGACT GCTGCGCGAA
GACCCGGATT ACCTGGCCAT CGCCACTCGG GTGCAGCAGC TCGAACTGTA TCGCGATGCC
GCCGGCGCTC TCGGCATGGC GCAGCCCGCC ACGGCCATGC GCAGCGCCAC CCTGCTGGAC
GGCAGGCGCT GGGACGGCTC CGACCCCGCG GCCTATGCCC GCAGCTTCGA CCTCCACGCC
CTGAGCGAGC TGCCCTCCTC GGCCGATGAC CAAGGACCAT CCCCATGCTG CGCATCCTCC
TGA
 
Protein sequence
MASFSALPQA VVEATPFRPA PENTMTDHHA TSRADARAWV AGSDAPEKSS INLGFMPLTD 
SASLIVAATQ GFAEPYGLTL NLKRQVSWSG LRDKLLSGEL DAAQGLYGLI YSMQLGIGGA
PATDMAVLMG LNQNGQSINL STPLRQAGVC SGETLVRHVR QSSAKLTLAQ TFPTGTHALW
LYYWLASLGI HPLADVNTLV VPPPQMVEHL RAGRIDGFCA GEPWGAHAID QGMGFTIATS
QSIWPDHPEK VLGCTRAFAE QYPNTARALI MAVLEASRFI DASEENKAGT AQLISADEYV
AAPRQVIEPR FLGDYEDGNG HAWRDSHALR FHGDGEVNLP YLSDGLWFMT QFRRWGLLRE
DPDYLAIATR VQQLELYRDA AGALGMAQPA TAMRSATLLD GRRWDGSDPA AYARSFDLHA
LSELPSSADD QGPSPCCASS