Gene Noc_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1016 
Symbol 
ID3707277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1125035 
End bp1126057 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content52% 
IMG OID637737521 
Productaspartate-semialdehyde dehydrogenase, USG-1 related 
Protein accessionYP_343054 
Protein GI77164529 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0136] Aspartate-semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01296] aspartate-semialdehyde dehydrogenase (peptidoglycan organisms) 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGAA CATTTGATGT GGCTGTCGTT GGAGCTACCG GCGCGGTGGG GCAGGCTATG 
ATGGAAATCC TGGCCCAGCG GGGCTTCCCC GTCAGCCGGG TGTACCCCTT GGCGAGCGAG
CGCTCCGCTG GGGAAAAACT TTCGTTTGGG CGCGATGAAG TCATCGTGGA AAACCTGGCT
GGCTTTGATT TTTCTAAGGT ACAGCTTGGC TTGTTTTCCG CCGGCGCTAA AATCTCGGCC
GAGTATGCGC CTAAAGCAGC GAGTGCGGGT TGCGTGGTGG TGGATAATAC TTCCCAGTTT
CGCTATGACA ATCGGATTCC TTTAGTGGTG CCCGAGGTCA ATCCCCAGGC AATTGAGGGT
TATAAGGACC ATGGGATCAT TGCCAATCCC AATTGTTCGA CTATCCAGAT GTTGGTGGCT
CTTAAGCCCA TCTATGATGC GGTGGGTATC GAGCGAATCA ATGTAGCCAC TTACCAGGCT
GTTTCTGGCA CCGGTAAAGA AGCTATCGAG GAGTTGGCAA AGCAAACCTC CACCCTGCTG
AATGGGCGGC CTATTTCGCC TCAAGCTTAC CCCAAGCAGA TTGCCTTTAA TGTATTGCCC
CATATCGATG ATTTTCTGGA TAACGGTTAC ACCCGCGAAG AGATGAAAAT GGTATGGGAG
ACACGAAAAA TTTTTGGTGA CGAGTCTATC TTGGTAAATC CAACCGCGGT GCGGGTGCCG
GTTTTTTATG GGCATTCGGA AGCCGTGCAT CTGGAAACCC GTGATAAGAT CACTGCCGAT
GAGGTTAAGG CGTTGCTGCA GCAGGCGCCT GGGGTTACGG TTTTGGATGA GCACACCAAT
GGAGGATATC CCACGGCGGT TACCGAAGCT TCAGGCAGGG ACCCAGTATT CGTGGGGCGT
ATCCGGGAAG ATATTTCCCA TCCCAAGGGT ATAGACCTCT GGATTGTAGC CGATAACGTC
CGCAAGGGCG CGGCGTTAAA TAGCATCCAA ATTGCGGAAT TGCTAATCAA GGATTACCTA
TAA
 
Protein sequence
MSRTFDVAVV GATGAVGQAM MEILAQRGFP VSRVYPLASE RSAGEKLSFG RDEVIVENLA 
GFDFSKVQLG LFSAGAKISA EYAPKAASAG CVVVDNTSQF RYDNRIPLVV PEVNPQAIEG
YKDHGIIANP NCSTIQMLVA LKPIYDAVGI ERINVATYQA VSGTGKEAIE ELAKQTSTLL
NGRPISPQAY PKQIAFNVLP HIDDFLDNGY TREEMKMVWE TRKIFGDESI LVNPTAVRVP
VFYGHSEAVH LETRDKITAD EVKALLQQAP GVTVLDEHTN GGYPTAVTEA SGRDPVFVGR
IREDISHPKG IDLWIVADNV RKGAALNSIQ IAELLIKDYL