Gene MCA1449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1449 
SymbolpqqE 
ID3105037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1540197 
End bp1541318 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content61% 
IMG OID637170625 
Productpyrroloquinoline quinone biosynthesis protein PqqE 
Protein accessionYP_113907 
Protein GI53804506 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR02109] coenzyme PQQ biosynthesis protein E 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0775464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGGAT CAGAGAAATC ATCGCTTACT AAACCGCGCT GGCTGCTGGC GGAGCTGACC 
TACGCCTGCC CGCTGCAGTG TCCCTATTGC TCCAACCCCC TGGATTACGC CCGCCTGGGT
GACGAGCTGA GCACCGAAGA ATGGAAGCGG GTGCTGAGTG AGGCCCGCGC GCTCGGTGCC
GTCCAGCTGG GGCTTTCCGG CGGTGAACCG CTGACCCGCC GCGACTTGGC CGAAATCGTC
ACCCACGCCC GCCAGCTCGG CTATTACACC AACCTCATCA CCTCGGGCTA CGGCCTGGAC
GAAGTCCGCA TCGCCGAATT GAAGTCGGCC GGCCTCGACC ACATCCAGGT CAGCATCCAG
TCGCCGGAAA AGCTGCTGAA CGATGAACTC GCCGGCACCG AGTCTTTCGA ACACAAACTC
AAGGTGGCCC GCTGGGTGAA GCAGCATGGC TATCCCATGG TCTTGTGCGT GGTGATCCAC
CGCCAGAACA TCCATCAGAT GCAGCAGATT TTGGAGATGG CGGACGAACT CGGGGCGGAT
TACCTGGAAC TGGCCAACAC CCAGTATTAT GGTTGGGCCC TGCTCAACAG GGACCATCTG
CTGCCGACCC GTGAGCAGTT CGCCGAAGCC GAGGCGATCG CGCAAAGCTA CAAGGAGAAG
GTGAAGGGAC GGATGAAGAT CTACTACGTC GTCCCTGACT ACTACGAAGA CCGGCCCAAG
GCCTGCATGA ACGGCTGGGG CACGACATTC CTCACCATCG CGCCGGACGG GATGGCCCTG
CCCTGCCACG CAGCCCGCGA ACTACCCGGG CTGAACTGCC CCAGCGTACG CGACTTCAGC
ATACGGGAAA TCTGGTACGA ATCGGCCGCC TTCAATCGTT TCCGCAGCTA CGGTTGGATG
AAGGAACCCT GCCGCAGTTG TCCGGAGAAG GAAAAAGACT TCGGCGGTTG CCGCTGCCAG
GCCTATCTCA TGACCGGCGA CATGGCCGAC GCCGACCCCG TGTGCAGCAA ATCCCCGCAC
CATCATCGCG TGCTGGAAGC CATTGCGTCG ACACAGCGAT CTGCGAGCGA CAAACCGCTG
TTCTTCCGCA ATGCCAGGAA CTCCCGAGCC TTGACGGGCT GA
 
Protein sequence
MAGSEKSSLT KPRWLLAELT YACPLQCPYC SNPLDYARLG DELSTEEWKR VLSEARALGA 
VQLGLSGGEP LTRRDLAEIV THARQLGYYT NLITSGYGLD EVRIAELKSA GLDHIQVSIQ
SPEKLLNDEL AGTESFEHKL KVARWVKQHG YPMVLCVVIH RQNIHQMQQI LEMADELGAD
YLELANTQYY GWALLNRDHL LPTREQFAEA EAIAQSYKEK VKGRMKIYYV VPDYYEDRPK
ACMNGWGTTF LTIAPDGMAL PCHAARELPG LNCPSVRDFS IREIWYESAA FNRFRSYGWM
KEPCRSCPEK EKDFGGCRCQ AYLMTGDMAD ADPVCSKSPH HHRVLEAIAS TQRSASDKPL
FFRNARNSRA LTG