Gene Noc_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1967 
Symbol 
ID3705426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2253828 
End bp2254847 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content50% 
IMG OID637738443 
Productpolysaccharide deacetylase 
Protein accessionYP_343959 
Protein GI77165434 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCAGAAG AAATTCTATC TGGTTTTGAC AAGGTGGTAT TGGGTGGACT ACGTCCCTTA 
GCTTCATTAC TTGCCTCCAG CGGCAAACGG GCACGCCTTT CGGTGCTGAT TTATCATCGT
GTGCTAGCGG CGCCAGACGC CCTGCTTCCG GGTGAGCCAG ATGCCGTGCA ATTTCGTTGG
CAGATGGAAC TCTTGGCGCG TTATTTCAAT CCACTCCCTC TATCAGAAGC CGTTAGGCGC
TTACGGGAAG AAACTCTACC TCCGCGGGCA GTTAGCGTAA CTTTTGATGA TGGTTACGCG
GATAACGTGG AGGTTGCATT GCCTATTTTG CAGCAAGTGA AGGTGCCAGC AACATTTTTT
ATCGCTACCG GCTATTTAGA TGGCGGGTGG ATGTGGAACG ATAAGGTCAT TGAAACGCTG
CGCCATATGC CTAATGGAGA TTTTGATCTA ACAGATATGG GGCTAGACAT CTACCCTTTA
GTAAGTACTG TAGATCGTTT GGCGGCGATT GAACAAATCC TTGTCCGACT AAAATATCTT
CCCAGCGAGG AGCGGGAGGA GCAGGTTCAA GCGCTCACTC AGCGTTGCCA AAAATCACTT
TCCCATAGCT TGATGATGAC TTCTATCCAA GTGCGCCAGC TTCACCAGGC AGGGATGGAG
ATTGGAGCCC ACACTCATAC CCATCCTATT TTAGCATCAC TGGGACGGGC ATCCGCAGAA
CAAGAAATGG CGACAAGCAA GGCTCGACTA GAAGATCTGC TGGGCGAATC GGTGCGTTTA
TTTGCCTATC CTAACGGTAA ACCTGGAAAA GACTACCTTC AGGAACATGC GGATATTGCT
CAAAATCTAA ATTTTGAAGC TGCGGTTTCT ACCGCCTGGG GCGTTGCTTC GGCTCAAAGT
GATATTTGGC AATTACCACG CTTCACGCCT TGGGATCGCA CCCCAAGCCG ATTTATGTTG
CGTCTCTTGT GGAACTACCG CAGTGCTGGT ACGCCGTTTG CTTATTCCGA AGCGGAATGA
 
Protein sequence
MPEEILSGFD KVVLGGLRPL ASLLASSGKR ARLSVLIYHR VLAAPDALLP GEPDAVQFRW 
QMELLARYFN PLPLSEAVRR LREETLPPRA VSVTFDDGYA DNVEVALPIL QQVKVPATFF
IATGYLDGGW MWNDKVIETL RHMPNGDFDL TDMGLDIYPL VSTVDRLAAI EQILVRLKYL
PSEEREEQVQ ALTQRCQKSL SHSLMMTSIQ VRQLHQAGME IGAHTHTHPI LASLGRASAE
QEMATSKARL EDLLGESVRL FAYPNGKPGK DYLQEHADIA QNLNFEAAVS TAWGVASAQS
DIWQLPRFTP WDRTPSRFML RLLWNYRSAG TPFAYSEAE