Gene Noc_2734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2734 
Symbol 
ID3705370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3104405 
End bp3105589 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content53% 
IMG OID637739216 
Productglycoside hydrolase family protein 
Protein accessionYP_344717 
Protein GI77166192 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATTA TCCACCACGC CCTTGTCTTA AACTTACATC AACCAGCAGG CAATCTGGAG 
CATTTACTGG AACATGCCCC CTGGGAGGCT CAAGAAATTC TCTATGCCCT GGACCGGATA
CCCCGCTCCC TTTGGGGATA CGAGGATATC GCCCGGGTTC ATCTAGCCTT CTCGGGTACG
CTCCTTGAAA CCCTCTCCAA TCCCGGCTTT CAGCAACGGG TCTATGGGAT TATCAAATGC
GGCGACTTGC TGTGGCATCT CCAGAATACC GCCTTATTTA ACCTTCTGGG GAGCGCTTAC
TATCACCCTG CTCTCCCCCT GATTCCTGGC CCGGATCGGG AAGAGCATAT CAAGCGCTGG
CTAGGTATCG GGCGGCATCT TTTTAGCCAA ACTCGCTTCA GCGGGTTTTG GCCGCCAGAG
ATGGCTTTTA CCATGGAATT GATCCCCCTG CTGAAGCAAC ATGGGTTCCG TTATGTCCTG
GTAGATAGCC TCCATGTGGA GCCTTTAGAG GAGATGAGTT GGCAGGAGCT CCGCTACCGC
CCCCATATTG CCGAGTACGG GGGGGAAGAA ATCATTGTGG TGGTCCGGGA CCGGGAACTC
TCAGATGCCC AGGAAGCCGG AATGGAGTAT GACTGGTTTC TAAATGAGCT TTATCACCGC
ACCCGGCCGT GTGACTTCCC GCCCCTGGTT ACCACCTGTA GCGATGGAGA CAATGGCGGC
TGGTTCCGTA ATACTTCCGA GGAAAGTAAT TTTTGGGGAA CATTTTACCG GGATTTTTTA
ACAGGGGCGC GCCGGGATAA TGCTATTCTC CGCCCCGCTT TCATCCAGGA TTACCTGGAC
AAATACGGCG CCAAGGGCCG AGTTAGGATA ACAACCGCAG CCTGGAATAC TGGCGATCAC
TCAGGGATTG ATTTTATCCA GTGGACCGGC TCCCAGTGCC AAAAAGAGGC CCTTAAACGG
GTAGAAGAAA CCAGTACGGC CTTCCATCAA CTAAAGCAAA AACATCTTGG CGCTCAAGCC
CTGAACCCGG AAATCGCCCA TCTTCTCAAT GAGGCGGAGT GGCACCTGCT ACGCTCGGAA
ACCAGTTGCC ACTTTTATTG GGGAGAAGCA TGGGTTCATC GCGCCCATGA AGATTTGGAT
ACCGCCTGGA CGCAAATGAA CACCGCAGCC CAGAAACTTA AATAA
 
Protein sequence
MSIIHHALVL NLHQPAGNLE HLLEHAPWEA QEILYALDRI PRSLWGYEDI ARVHLAFSGT 
LLETLSNPGF QQRVYGIIKC GDLLWHLQNT ALFNLLGSAY YHPALPLIPG PDREEHIKRW
LGIGRHLFSQ TRFSGFWPPE MAFTMELIPL LKQHGFRYVL VDSLHVEPLE EMSWQELRYR
PHIAEYGGEE IIVVVRDREL SDAQEAGMEY DWFLNELYHR TRPCDFPPLV TTCSDGDNGG
WFRNTSEESN FWGTFYRDFL TGARRDNAIL RPAFIQDYLD KYGAKGRVRI TTAAWNTGDH
SGIDFIQWTG SQCQKEALKR VEETSTAFHQ LKQKHLGAQA LNPEIAHLLN EAEWHLLRSE
TSCHFYWGEA WVHRAHEDLD TAWTQMNTAA QKLK