Gene Noc_1151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1151 
Symbol 
ID3706916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1258904 
End bp1260103 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content50% 
IMG OID637737655 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_343185 
Protein GI77164660 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAACCA ATGATCAAAG CAAGTACGAT TTTGCTACGC TGGCAGTGCG AGCGGGGCAG 
CAACGTACTG GTGAGGGTGA ACATGCCGAG CCTATATTTC CTACTTCCAG CTTTGTGTTT
GAAAGTGCTA CAGAAGCTGC TGCTTGTTTC GCAGGAGAAA TAGCAGGAAA TATTTATTCC
CGATTTACTA ACCCTACGGT CCGGACTTTT GAGCAACGCC TTGCAGCTTT AGAAGGGGGT
GAGCGTTGCG TGGCGACTTC ATCGGGGATG GCGGCTATCC TTGCGACTTG TATGGCGCTG
TTGAAGGCAG GAGATCATAT CGTTTCCTCG GAGAATATTT TTGGAACGAC GCGGGTTCTC
TTCAATAAAT ATCTGGCCCG CTTTGGCGTA GAGACCACCT TTGTTCCTCT CATTCAGTTG
GAAGCCTGGG AAAACGCCTT GCGTCCCAAT ACTCGCTTAC TATTTCTGGA AACGCCTTCC
AACCCTCTGA ATGAAATAGC AGATATCGTT CAACTATCCA GTCTGGCGCA GGCCCATGGT
TGCTTATTAG TGGTGGATAA TTGTTTTTGT ACTCCTGCTT TGCAGCGCCC TTTCGAGCTA
GGGGCGGATC TTGTTATTCA CTCTGCCACC AAGTACCTCG ATGGCCAAGG GCGGTGCGTG
GGGGGCGCCG TAGTGGGTGA TGGGCAACGG GTAGGAGAGG AAATCTTTGG CTTTTTGCGT
ACTGCGGGTC CAACAATGAG TCCCTTCAAT GCCTGGGTTT TTCTTAAAGG TTTAGAAACC
TTGCAATTGC GGATGGAAGC GCTGAGTCGA CAGGCTCAGG CTTTGGCCGA ATGGTTGGAA
GCGGAGCCAA AAGTTTCAAG GGTATATTAT GCAGGTTTGC CTTCCCATCC TCAACACACG
CTGGCCTCGA AGCAACAGTC GGGCTTTGGT GGCCTGGTCG CATTCGAGCT AAAAGGAGGG
AAGGCGGCCG CCTGGAAACT TATCGATTCT CTCAAGTTTA TCTCTATTAC TGCTAATCTT
GGGGATGTGA AAACCACCAT TACTCACCCG GCTACCACGA CCCATGGTCG TTTAACGGAG
GAGGAGCGAT TGGCAGCAGG TATCAGCGAT GGTTTGGTAC GAATCTCCGT GGGTTTAGAG
TCCCTTGAGG ATATTAAAAA AGATTTACAG CGGGGTTTGG ATAGGATGGC CCAAGGTTGA
 
Protein sequence
MLTNDQSKYD FATLAVRAGQ QRTGEGEHAE PIFPTSSFVF ESATEAAACF AGEIAGNIYS 
RFTNPTVRTF EQRLAALEGG ERCVATSSGM AAILATCMAL LKAGDHIVSS ENIFGTTRVL
FNKYLARFGV ETTFVPLIQL EAWENALRPN TRLLFLETPS NPLNEIADIV QLSSLAQAHG
CLLVVDNCFC TPALQRPFEL GADLVIHSAT KYLDGQGRCV GGAVVGDGQR VGEEIFGFLR
TAGPTMSPFN AWVFLKGLET LQLRMEALSR QAQALAEWLE AEPKVSRVYY AGLPSHPQHT
LASKQQSGFG GLVAFELKGG KAAAWKLIDS LKFISITANL GDVKTTITHP ATTTHGRLTE
EERLAAGISD GLVRISVGLE SLEDIKKDLQ RGLDRMAQG