Gene Noc_2858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2858 
Symbol 
ID3705399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3236686 
End bp3237636 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content52% 
IMG OID637739334 
ProductD-alanine--D-alanine ligase 
Protein accessionYP_344834 
Protein GI77166309 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes 
TIGRFAM ID[TIGR01205] D-alanine--D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000136858 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGGGA GGGTGGATGT CAAAGCCTGG GGCAAGGTTG TGGTGTTGAT GGGTGGCTAT 
TCTGCGGAGC GAGAGATTTC TCTTAAAAGC GGTACCGCCG TCCTTCAATC TCTCCTAAGG
CAGGGAATTG AAGCCCATGG AATTGATGTG GATAAAGGAG TGTTAACGCA GCTTTCAAAA
GGGCAGTTTA CCCGGGCCTT TATAGCTCTG CACGGACGCG GGGGAGAGGA TGGTGTTATT
CAGGGAGTTT TGGAAACTTT AAACTTGCCC TATACCGGCA GTGGGGTATT GGGGTCTGCC
CTGACCATGG ATAAGCTGCG TAGTAAGCGG CTTTGGCGGG GAATGGATTT GCCTACCGCA
GATTTTTCTG TGCTCACCAG GGATACAAAT CCAGCTTTAA TTGCGGCTGA CCTAGGTTTA
CCCCTTATTG TGAAACCGGC GCGGGAAGGT TCAAGCTTGG GGATGATGAA AGTAGAAAGC
ATTGAGGCGT TGCAATCTGC TTATAGAGAA GCGGTGATTT TTGATACAGC GGTATTTGCC
GAGCGGTGGC TACCAGGGGC GGAGTATACC GCTGCTATTC TTGCGGACCG AGTCTTGCCG
CTCATTCGTT TGGAGACGCC CCGCGTTTTC TACGATTTCG AAGCGAAATA TCATGCTAAT
ACAACCCGTT ATTTCTGCCC CTGTGGGCTC TCGGAGAAGC AAGAGCAAGA CTTGCAGGCA
CTAGCTTTAG AAGCGTTTCA GGCCCTTGGG GCTAGCGGCT GGGGACGAGT GGATTTGCGC
TGCGATGAGA AAGCACATCC CTATTTGCTT GAGATCAATA CGGTGCCGGG TATGACGGAT
CATAGCTTAG TGCCGATGGC GGCCCAGGCT GCGGGTATTG AGTTCGATGA GATGGTTTTG
CAGATATTGG CAAGTAGTTT GGAGCGGAGA ATGTTCCAGG ATGGCACGTA G
 
Protein sequence
MIGRVDVKAW GKVVVLMGGY SAEREISLKS GTAVLQSLLR QGIEAHGIDV DKGVLTQLSK 
GQFTRAFIAL HGRGGEDGVI QGVLETLNLP YTGSGVLGSA LTMDKLRSKR LWRGMDLPTA
DFSVLTRDTN PALIAADLGL PLIVKPAREG SSLGMMKVES IEALQSAYRE AVIFDTAVFA
ERWLPGAEYT AAILADRVLP LIRLETPRVF YDFEAKYHAN TTRYFCPCGL SEKQEQDLQA
LALEAFQALG ASGWGRVDLR CDEKAHPYLL EINTVPGMTD HSLVPMAAQA AGIEFDEMVL
QILASSLERR MFQDGT