Gene Noc_2145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2145 
Symbol 
ID3705337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2477708 
End bp2478979 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content53% 
IMG OID637738621 
ProductL-sorbosone dehydrogenase 
Protein accessionYP_344135 
Protein GI77165610 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAC CACGCAATAA CGCCCTGGCT TGGGGCGTAT GGATACTATT CTTTTGGTTC 
TCTTTCTCTT CTATCACTCT GGCGCAACAG GTTTGCGACT CAGAAAATGA AGGGCTTACT
CTGCCTAAGG GTTTTTGTGT CCTTATGGTG GCGGATAAAG TAGGGAAGGC CCGCCATTTG
ACAGTGGCCC CCAATGGCGA TGTCTTCGTC GCTATTGGCG CAACCAAGGC ATCGCCAGGG
GGTGTGTTGG CGCTACGAGA TACCACAGGC GACGGTGTTG CCGATTTAAA AAAGCGTTTT
GGCAGTGGTC CTGGCGATGA TGTGGAATTT TATGATGGTT ACCTTTATTT CGCGACTCAC
GAAAAAATCG TGCGCTATCC TTGGCGTAGT GGAGATTTAG AACCGGCGGG GCCCGCCGAG
ACTATCGTTG AAAAGCTCCC TGCCGCTGCT AGCCATCGAG CTAAAAGTAT CGCTTTTAGT
CCTGAAGGCA AGCTTTATGT CAATATCGGT TCGCCCTCCA ACGCCTGCCA GAAACAGGAC
CGCACCGCCG GCTCACCGGG AAAGGAGCCT TGCGATGAAC TTGTTACCCG TGCTGGAATC
TGGCGTTTTG AGGCCGGCCA ACCCAACCAA GCCCAGCAAG ATGGTAGCCG TTTTGCCACA
GGGCTTCGTA ATACGGTCGC TTTAGCTTTA CGCCCCCAGG ATGGCCAGTT GTACGGCGTC
ATTCATGGCC GCGATCAGCT AAGTTTGTGG CCTCACTTCA ATGATAGCCA GAATGCAGAA
AAACCCTCGG AGGAGTTGGT GCGCATCCAG GAAAATAACG ATTTTGGTTG GCCCTACTGC
TATCATGATC CCGCCCTTAA CCAGAAAGTT CTGGCCCCCG AGTATGGCGG AGATGGAAAA
ACCGTGGATC GCTGCCAGAA AAAACAAGAT CCGCTGCTGG CCTTGCCCGC CCATTGGGCA
CCTAATGGGC TCCTCTTTTA TTCTGGCGAA CAGTTCCCAG AACGGTATCG GGGCGGGGCT
TTTATTGCCT TCCATGGTTC CTGGAACCGG GCGCCATTGC CCCAGGGGGG TTATAAGGTT
GTTTTTGTTC CTTTCAAGGG AAAGGAACCC ACGGGCGAAT GGGAGGTATT TGCCGAGGGT
TTTGCCGGTC AACATAAGAC TCCCCGCGCT GCCGAGCATC GACCAGTGGG TGTTGCCGAA
GGTCCGGAGG GCTCCCTTTA TATTAGTGAT GATCAGGGAG GTCGTATCTA TCGTGTTTTC
TATAGGCCAT AG
 
Protein sequence
MAKPRNNALA WGVWILFFWF SFSSITLAQQ VCDSENEGLT LPKGFCVLMV ADKVGKARHL 
TVAPNGDVFV AIGATKASPG GVLALRDTTG DGVADLKKRF GSGPGDDVEF YDGYLYFATH
EKIVRYPWRS GDLEPAGPAE TIVEKLPAAA SHRAKSIAFS PEGKLYVNIG SPSNACQKQD
RTAGSPGKEP CDELVTRAGI WRFEAGQPNQ AQQDGSRFAT GLRNTVALAL RPQDGQLYGV
IHGRDQLSLW PHFNDSQNAE KPSEELVRIQ ENNDFGWPYC YHDPALNQKV LAPEYGGDGK
TVDRCQKKQD PLLALPAHWA PNGLLFYSGE QFPERYRGGA FIAFHGSWNR APLPQGGYKV
VFVPFKGKEP TGEWEVFAEG FAGQHKTPRA AEHRPVGVAE GPEGSLYISD DQGGRIYRVF
YRP