Gene Saro_2972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2972 
Symbol 
ID3917407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3192952 
End bp3194160 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content66% 
IMG OID640445750 
Productnitrate transporter component, nrtA 
Protein accessionYP_498241 
Protein GI87200984 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGCGG AACTCGCGAT AGGCTTTCTG CCGCTGGTCG ATGCGTGCCT GCCGATCCTG 
GCGCGCGAAC ATGGCTTTGC CGAGGCGGAA GGAATCTCGC TGAAGCTCGT TCGCGACATG
AGCTGGGCGA CGGTGCTGGA CCGCCTGCTC TATGGCCACA GCGATGCCGC GCACCTCGTC
GCGCCGCTTG CGATTGCGAC GACGCTGGGG CGGGGACGTC CCGCGGTGCC GCTTTCGGTG
CCATTCGTGC TCGGGCTGAA CGGCAATGCG ATCACCATGC GCACCGCACT GGCCGAGAGG
GTTTGCCCGG CGGGAAGGCT GGGCGATCCG CGTGCCGTGG GCGCGGCCTT GAAGGACGTG
GCCGCCGAGC GCAAGGCGGC GGGCAATCCG CTGCGCTTCG GCGTGGTGCA TCGCTATTCC
AGCCACAACT ACATGCTGCG CTACTGGCTT GCCGCATGCG GCATCCGACC GGACAAGGAC
GTGGAGATCG CGACGGTGGC GCCGCCGTTC TGTTCCGACG CGCTGGAAGC CGGCGAGGTG
GACGTCATTT GCGTCGGGGA GCCGTGGAAC TCGGTGGCGG TAGAGCGGGG CGCGGGCCGG
ATCGTGCTCG TGACGGCGCA GATCTGGCGA CGCGGCGTGG AGAAGGTGCT GGCTTTGCGC
GAGCCCGTCC TCAAGGAGCG GCGCGGCGAG GTCGAAGCCT TGCTGCGGGC GCTGGTGGCA
GCGGCGCGGC ACTTCGTTTC CCCTGAAAAC TGGGATTCGA ACGCAGCGAT TCTGGCGCGA
CCTGAGTACC TTGATGGTTC GCCAAGCCTG ATAAGGCGTG CCATTTCGGA TCGAATCCTG
CTGGCGCGCG GGGGGGAGCC GGTCCATTAT CCGGACTTCA TGTTCCAGCA TCGCGAGGCG
GCCAATTTCC CGTGGGTGAG CCAGGCCGAG TGGCTCTATA CCCAGATGGT GCGTTGGGAA
GGCATGGAAT TCGATCCGGA AATGGCGCGC AAGGCGGCGC GCGTGTTCCG GCCCGATGTC
TACCGCAGTG CACTACTGGG GTCGGGAGAG CCTTTGCCCG GGGCAAGTTC GAAGGTCGAG
GGCAGCCTTG GAGGGCTGAC TTCGGTGGGG ACGCAGCAAG GGGTCATGAC GTTGGAAAAC
AATCAGTTTT TCGATGGTCG GGCGTTTGAT CCCCATGATC TTCAGGGCTA TCTCGCAAGT
CAGGCGTGA
 
Protein sequence
MGAELAIGFL PLVDACLPIL AREHGFAEAE GISLKLVRDM SWATVLDRLL YGHSDAAHLV 
APLAIATTLG RGRPAVPLSV PFVLGLNGNA ITMRTALAER VCPAGRLGDP RAVGAALKDV
AAERKAAGNP LRFGVVHRYS SHNYMLRYWL AACGIRPDKD VEIATVAPPF CSDALEAGEV
DVICVGEPWN SVAVERGAGR IVLVTAQIWR RGVEKVLALR EPVLKERRGE VEALLRALVA
AARHFVSPEN WDSNAAILAR PEYLDGSPSL IRRAISDRIL LARGGEPVHY PDFMFQHREA
ANFPWVSQAE WLYTQMVRWE GMEFDPEMAR KAARVFRPDV YRSALLGSGE PLPGASSKVE
GSLGGLTSVG TQQGVMTLEN NQFFDGRAFD PHDLQGYLAS QA