Gene Noc_1956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1956 
Symbol 
ID3704970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2241402 
End bp2243018 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content53% 
IMG OID637738432 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_343948 
Protein GI77165423 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01733] amino acid adenylation domain
[TIGR03098] acyl-CoA ligase (AMP-forming), exosortase system type 1 associated 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.536259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAGAA TCCATTCAGC AAATTCCTTA GTACACTCTT TAGTACTCGA TAACGCCCTC 
AAGGGCCCGG ATGCGTCAGC ACTCGTGCAT GGTGACCAAA CTCTAACTTA CGCCTCCCTC
GGTGAAACGG TCGAAGCCTG CGCTCGTGGG CTTCTAGCGC TTGGACTTGC TTCCTCCGAG
CGCGTTGCCA TATATTTGCC CAAACGCCCC GAAACGGTAG TTACCCTCTT CGGCGCTGCA
GCCGCCGGCG GCGTATTTGT ACCTATCAAT CCTTTGCTCA AACCCCGGCA AGTTGCTCAC
ATCCTTCGAG ATTGCAATGT CCGGGTACTC GTCACCGCCA GCAACCGTAT TGATTTTTTA
CAAGATGCGC TTGCTGAATG CCACGATCTG CGAAGCCTCG TCATTGTGGA TGCCCCGACT
CAGACGATTG AAAAATTAGC GCAGCCAATG GCTATCTCCT GGGAGCGTCT CTTATCACTG
GGAACTACTC AACAATCCCC AGGGCATCGC CGTATTGACA GTGATATGGC TGCCATCCTC
TACACTTCTG GAAGCACGGG ACGCCCTAAA GGCGTGGTGC TTTCCCATCG AAATTTAGTC
GCAGGTGCGC AAAGCGTAGC CCAGTATCTG GAAAATAATT CCAACGATCG TCTACTCGCG
GTATTGCCCT TAAGCTTCGA CGCCGGCTTT AGCCAGCTCA CGACCGCCTT TTCTGTCGGC
GCAAGCGTAG TACTAATGGA ATATCTGCTG CCAAAAGATG TCATTAAAAG CATCACTCGC
CATGGGATCA CAGGGATAAC TGCCGTACCC CCCCTCTGGG TCCAACTTGC CTCCCTTGCC
TGGCCCCCCG AAGCCGCGGA TACTCTGCGG TATATTGCCA ATACCGGAGG CCGAATGCCC
AAAGCAGCCA CGACAGCCTT GAGACGATCT TTGCCTCAAA CCAAAGTATT TCTGATGTAT
GGACTCACAG AAGCATTCCG CTCCACCTAC CTTCCTCCTG AAGAAGTTGA TAAACGCCCC
GATTCCATTG GCAAAGCCAT CCCCAACGTA GAAATCCAAG TAGCCCGCGA GGATGGCAGC
CTATGCCTGC CTGGGGAATC AGGAGAGTTG GTACACCGGG GTGTCCTGGT AGCCATGGGT
TACTGGAACG ATCCTAAAAA AACGGCGGAA CGCTTCCGTC CTACTCCAGG GCAACCCCCT
GAACTTCCTC TCACCGAGAT AGCGGTATGG TCCGGTGATA CAGTACGTAT GGATGAGGAC
GGTTTCTTCT ACTTCATCGG CCGCCAAGAC GAGATGATCA AAACCTCCGG CTACCGGGTA
AGCCCAACCG AAGTAGAAGA AGTCCTGTAC CAAGCAGGGC TTGTAGCTGA AGCTGCAGTC
GTGGGTGTGC TCCATCCAAA ACTTGGCCAA GGGATCGTCG CCATAGTAAA ACCAAACAAG
GATAATTTTG ATCCTGAGGA TTTATTGGCT ACTTGTCGCG CCGAACTTCC GAATTTTATG
GTTCCTCTTG CCGTGATAGT TTCCGAGAAT CTACCCCGAA ACACGAATGG TAAGATTGAC
CGGCGCGCAC TCGCCATGGA ATTCGAACTT CTATTCAAGG AACAAACCGC CCCATGA
 
Protein sequence
MSRIHSANSL VHSLVLDNAL KGPDASALVH GDQTLTYASL GETVEACARG LLALGLASSE 
RVAIYLPKRP ETVVTLFGAA AAGGVFVPIN PLLKPRQVAH ILRDCNVRVL VTASNRIDFL
QDALAECHDL RSLVIVDAPT QTIEKLAQPM AISWERLLSL GTTQQSPGHR RIDSDMAAIL
YTSGSTGRPK GVVLSHRNLV AGAQSVAQYL ENNSNDRLLA VLPLSFDAGF SQLTTAFSVG
ASVVLMEYLL PKDVIKSITR HGITGITAVP PLWVQLASLA WPPEAADTLR YIANTGGRMP
KAATTALRRS LPQTKVFLMY GLTEAFRSTY LPPEEVDKRP DSIGKAIPNV EIQVAREDGS
LCLPGESGEL VHRGVLVAMG YWNDPKKTAE RFRPTPGQPP ELPLTEIAVW SGDTVRMDED
GFFYFIGRQD EMIKTSGYRV SPTEVEEVLY QAGLVAEAAV VGVLHPKLGQ GIVAIVKPNK
DNFDPEDLLA TCRAELPNFM VPLAVIVSEN LPRNTNGKID RRALAMEFEL LFKEQTAP