Gene Noc_2623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2623 
Symbol 
ID3704514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2978192 
End bp2979349 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content51% 
IMG OID637739104 
Productpyrroloquinoline quinone biosynthesis protein PqqE 
Protein accessionYP_344606 
Protein GI77166081 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR02109] coenzyme PQQ biosynthesis protein E 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGGCT CTCCACCGAA AATTAATCAT TTTACTGGCA ACGCAAGAAC CCAGCCTCTA 
TGGCTGCTAG CGGAGCTTAC CTATGCCTGC CCCCTGCAAT GTCCGTACTG TTCTAACCCC
TTAGACTTTG CTAACTACAA ACATGAACTC TCTACGAAGG ACTGGTTGCG GGTATTCCAT
GAGGCCCGGG CTATGGGGGC CGCTCAGCTA GGATTCTCTG GCGGTGAACC TCTGGCTCGG
CGTGATCTGG AAGTGCTCAT CACCGAAGCC CGTAAACTTG GCTATTACAC TAATCTCATT
ACTTCCGGCA TCGGCATGGA TGAAGACCGA ATTGCGGCTT TTAAAACCGC TCGTCTCGAT
CACATCCAAA TTAGTTTCCA GGCTGCTTCC GAAGATCTCA ATAATCGGCT TGCCGGCGCG
GATGTTTTTC AGTATAAACT GGCCATGGCC CGGGCCGTGA AAAAACACGG CTATCCCATG
GTGCTATGCT TTGTTCTCCA TCGCTATAAT ATTGACCAGA TAGGCAAAAT TCTGGATCTT
GCCATCGAAC TCAAAGCCGA CTATGTGGAA TTGGCCACTA CCCAGTATTA TGGCTGGGCC
TGGCATAACC GAAACCACTT ACTCCCCACC AGGGAACAGC TAGAACGGGC TGAGGCCCTA
GCCCGCCAGT ACCAAGTCCG CACACAGGGA AAAATGAAGA TTTATTATGT GGTGCCGGAT
TATTACGAGA ACCGCCCCAA GGCCTGCATG AATGGTTGGG GAAATATTTT TCTCACCATC
GCGCCGGATG GAACCGCCCT CCCCTGCCAT GCTGCCCGCC AGCTACCCGG ACTTACCTTA
CCCAATGTAA AAAGCCATAG TATTGAGTGG ATCTGGTATG AATCACCGGA CTTTAATTTA
TTCCGGGGCC AAGGATGGAT GAAGGAACCT TGCCGAAGTT GCCCCGAACG TTTCAAGGAC
TTTGGGGGAT GCCGCTGTCA GGCCTATTTA TTGACGGGCG ATGCTCGTAA TACCGATCCC
GTCTGTGATC TATCCCCCCA CCATCAGACT GTGGTAGACG CTATTACCGC AGCTCACCAG
CAAACACCTT TACCGGCAAA CAAGAGCAAA CCACCGATAT TCCGTCATCT ACGAAATTCT
AAAAAATTCT GTGGCTAA
 
Protein sequence
MAGSPPKINH FTGNARTQPL WLLAELTYAC PLQCPYCSNP LDFANYKHEL STKDWLRVFH 
EARAMGAAQL GFSGGEPLAR RDLEVLITEA RKLGYYTNLI TSGIGMDEDR IAAFKTARLD
HIQISFQAAS EDLNNRLAGA DVFQYKLAMA RAVKKHGYPM VLCFVLHRYN IDQIGKILDL
AIELKADYVE LATTQYYGWA WHNRNHLLPT REQLERAEAL ARQYQVRTQG KMKIYYVVPD
YYENRPKACM NGWGNIFLTI APDGTALPCH AARQLPGLTL PNVKSHSIEW IWYESPDFNL
FRGQGWMKEP CRSCPERFKD FGGCRCQAYL LTGDARNTDP VCDLSPHHQT VVDAITAAHQ
QTPLPANKSK PPIFRHLRNS KKFCG