Gene Noc_2143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2143 
Symbol 
ID3705335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2473217 
End bp2474962 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content59% 
IMG OID637738619 
ProductBNR repeat-containing glycosyl hydrolase 
Protein accessionYP_344133 
Protein GI77165608 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.38256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAT CAAGCATAAA GTTATATAAG TCTTTTTTAC TGGGCGCTGC CGCCCTTCTC 
CTGGGAACGG CATCGGCCCA AGGGGCCCAA TGGCGTCCCC TGTGCCCGGA AGAGGTCTCC
TGCGTCAATC CTCAAATTTT TATTAATTCT AATTCCTTAG GCCGGTCGGA GTTGCTGGCC
CCCGCCTTGG CCCCCCTGGT GAGAACGGAT AGCGCGTGGT TGCAGTATTC CGATCAATTC
GAGCCTGATC CCCTGCAACT GCGTCCACGA GGGGTCTTTC GACTTCATCC CCCCCAAACG
GCCCTGGAGA TCCTGCTGCC CCTTAAGGAT TCTCCCCCGG CAACCCCCGA CCCCGATCTG
GGCTCCTCGG ACCTGGAGGA GGTCTTTGAT CCACCTAATA TTCAGCCCAC TTACCTGAAT
GTGTCCGATT TTAACGCCAT GGAGGCCGAG GATGGCAGTA CCCTGGTTTT TGGCCTGTAT
GGGATGCTGC GGTTATCCCC CGAGGGTGGG CTCATCGACG CCCTTAATTG GTCCGGCCGC
AGCGTGCTGG GTCTGAACAA TGCCGCCTTC ACCGGCCCGC CGATTAGAGT GCAGGATTCC
CTTTTTGTAG GGATTCGGGG TCCCGCCCAG CAGTTTATTT ATCACAGTGA GGATGATGGC
CTGACCTGGC AGGAGGAGGT GGCGAGTAAT CGTCTTGGGG ACGATCGCTA TAACCTGCTG
GCCAATCCGG AAGGCACCGG CTTGTGGGCG ATTATCTCCG AGTTTTTTGA CCGCCCGGCG
GAACTGCGGG AATCCCTGGA TTTAGGCGCC ACCTGGAACC GAGTAGACAA TGGCAGTTTC
CCAGCCCATA CAGTGCGGGT GGTCCATGAT CCGGGCGATC CCCAGGTGGC CTACGCCCTC
TCGGCGCAAG GCTTATACCG GAGCCAGGAT CGGGGGGTAT CCTGGCACTT GACCGCCTTA
CAGGAGCCGG TCCATGGGCT GGTTTTTGTA CCCCAAAAGG CGCCTTTGCC ACCGCTGCTG
GTGGCGGGCA CCGACACAGG CGTTAAAATC AGTCCAGAGC CGTTCGGTAC TTGGGAAGCC
TTGAGCAATG GTCTGCTGGC TATTCCCCAT ACGGTGGTCT ATACGGACGG CCTGCTGATA
GGCGTCAGCG CTGCCGGCTA TTTTGTTTGC CCCCAGGCCG ATTGCTTTGG GGAGAGTCAA
GCCGTGCCAG CCGAAGAGGC GCGGGGCGAA GTGACCGTGA CGGAATTTTT CAATGTCGAT
TTAGGCCATT ATTTCATGAC GGCCTCCCCA GAGGACGTTG CTATCATTGA GGCCGGCGGA
GCAGGGCCCG GCTGGGAACG TACCGGTCAT ACCTTTAAAG CCTGGAGTAA TTTAGGCAGT
GACGTGGGGG TGTATCTCTG CCGTTTCTAT GGCTCCGTCT CCCCCGGCCC CAACAGCCAT
TTCTTTACCG CCTCTCCCCA GGAGTGTGGT TTTTTGCTGG ACCTGCAAGC CCAAACGCCC
CCCACCGTGC CCCGCTGGAA CTTCGAGGGC GATGCCTTTA TGGCCATCCC CGCCCAGGGC
AAGGGGGACG CGCAGCATTG TCCGGAGGCA TTCGTTCCCG TCTACCGGGC CTATAACAAT
GGCTTTGCCC GAGGAGAGGA GAGTAACCAC CGCTTTGTGA CCGACCGGAC CTTGCTCACG
CCCTTATTAG ACCAAGGCTG GGTAGATGAA GGCATTGCGT TCTGCGTGCC ACCCCAATCC
CAATAA
 
Protein sequence
MAKSSIKLYK SFLLGAAALL LGTASAQGAQ WRPLCPEEVS CVNPQIFINS NSLGRSELLA 
PALAPLVRTD SAWLQYSDQF EPDPLQLRPR GVFRLHPPQT ALEILLPLKD SPPATPDPDL
GSSDLEEVFD PPNIQPTYLN VSDFNAMEAE DGSTLVFGLY GMLRLSPEGG LIDALNWSGR
SVLGLNNAAF TGPPIRVQDS LFVGIRGPAQ QFIYHSEDDG LTWQEEVASN RLGDDRYNLL
ANPEGTGLWA IISEFFDRPA ELRESLDLGA TWNRVDNGSF PAHTVRVVHD PGDPQVAYAL
SAQGLYRSQD RGVSWHLTAL QEPVHGLVFV PQKAPLPPLL VAGTDTGVKI SPEPFGTWEA
LSNGLLAIPH TVVYTDGLLI GVSAAGYFVC PQADCFGESQ AVPAEEARGE VTVTEFFNVD
LGHYFMTASP EDVAIIEAGG AGPGWERTGH TFKAWSNLGS DVGVYLCRFY GSVSPGPNSH
FFTASPQECG FLLDLQAQTP PTVPRWNFEG DAFMAIPAQG KGDAQHCPEA FVPVYRAYNN
GFARGEESNH RFVTDRTLLT PLLDQGWVDE GIAFCVPPQS Q