Gene Noc_2943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2943 
Symbol 
ID3706425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3330777 
End bp3331817 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content53% 
IMG OID637739420 
Productcholoylglycine hydrolase 
Protein accessionYP_344918 
Protein GI77166393 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3049] Penicillin V acylase and related amidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGAC AGATGATTTA CCGAACCGTG GCAAGTGCAG CTATTTTCGT GGTTGTGGGA 
CTTGTTTTCT TGCAGGCCGC CGATGCCTGT ACGCGTGTTC TTTGGAACGA CAGCGGGCTG
AACGTCGTGG TGGGGAGGAC GATGGACTGG CCGGAGTCTA CCCAGCCGGA GATCGTGGTC
TTTCCACGTG GAATGAAACG GGATGGGGGC CTCCTCGGTA CGGAGACTGC GGTCAAGGTG
AATCCAGCCA AGTGGACATC CAAATATGCC AGTATGGTGG TCCCAGTTTA CGGCATTGGC
ACCGCTGACG GCTTTAATGA AGCCGGGCTA GCAATTCACA TGCTGTACCT CGAAAATACG
GATTTTGGGC CACGCGATCC CAGCAAGCCG GGTGTACAGG CTGGTCTATG GGGGCAGTAT
GCACTGGACA ATGGGGCGAC AGTTGATCAA GCATTGCCCC TGCTCAAGAA GATCCAGCCG
GTGATGGTCG AGATGCACGG ACACAAGGCC ACGGTTCACC TGGCCTTGGA AGATGCCACG
GGTGATTCTG CTATCCTCGA GTACATTAAC GGCAAGCTGG TCATTCATCA TGGCCGTCAA
TATCGGGTCA TGACCAACGA TCCAAGCTAT GATCAGCAAC TCGCGTTGCT GCAGAAGATG
AAAAAAGAGG TTGATTTCAC GCATCCAAGC AGTAACACCC CGTTACCCGG CAATGTCAGT
GCTACGGATC GTTTCCAGCG GGCGTCTTAT TTCTCGGCAT TGTTACCCAA GCCGAAGGAC
GAACGCGAGG AAGTCGCTTC TATACTGTCC ATTATGCGCA ACGTATCGGT GCCATTTGGT
GCACCCTACC AGAGCTTTGG TATCTACAAT ACTGAGTACC GTACGGTGAC CGATCTCGAC
ACTAAGCGCT ACTATTTTGA ATTGACGACT GCACCGAATG TGATCTGGGC AGATCTTACG
AAATTTGACC TGAAACCCAG GTGCACCGGT AATGGTGCTA AATCCGGACA ATATCGGCCT
GAGTGGGAAT GTCACGGATA G
 
Protein sequence
MNRQMIYRTV ASAAIFVVVG LVFLQAADAC TRVLWNDSGL NVVVGRTMDW PESTQPEIVV 
FPRGMKRDGG LLGTETAVKV NPAKWTSKYA SMVVPVYGIG TADGFNEAGL AIHMLYLENT
DFGPRDPSKP GVQAGLWGQY ALDNGATVDQ ALPLLKKIQP VMVEMHGHKA TVHLALEDAT
GDSAILEYIN GKLVIHHGRQ YRVMTNDPSY DQQLALLQKM KKEVDFTHPS SNTPLPGNVS
ATDRFQRASY FSALLPKPKD EREEVASILS IMRNVSVPFG APYQSFGIYN TEYRTVTDLD
TKRYYFELTT APNVIWADLT KFDLKPRCTG NGAKSGQYRP EWECHG