Gene GM21_3083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3083 
Symbol 
ID8138433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3575519 
End bp3576709 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content60% 
IMG OID644870687 
Productmajor royal jelly protein 
Protein accessionYP_003022869 
Protein GI253701680 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.058624 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAG AAGGTTTGCT ATGGCCGCGG GGAGTCCAGC TGAAGACGAA GCCAAGTACT 
GCAATTGCAA CAATGGCAAG ATCGGTTTGC CTCTGCTTAC TAACAATCTC GATCTGCCTC
GGATGTGCCC ATAAGCAAAC GCCGCAGTCG GACGGGCTGA AGAGCGTCCC GGACCGGCTG
AAGACCATAG CTACCTTCAG GGGAGCACAG GTCACCGGGG TTACTTCGAC CGATACCGGT
AGACTCTTCG CCAATTTCCC CCGATGGCGC GAAGGGGTCC CCTTCTCCGT AGTCGAGGTG
TCGCCTGACG GCTCTTTTAC CCCTTACCCT GACGCGGAGT GGAACCGGTG GGAGGGGTAC
CCGCAGCCCG ATCGTTTCAC CTGCGTGCAA TCGGTGGTGG CGCACGGGGA TTCTCTCTAC
GTGCTTGACC CCAGCAATCC GCAGTTCGCC GGCGTGGTCG GCTCGGCAAA GCTTTTCGTC
TTCGACCTGA AGACGAACCG GTTGAAGCGC AGGTACGAGT TCCACAACGG CGTCGCGCCG
GAAAGATCGT ACCTCAACGA CCTGCGCATC GACGATGCCG CCGGAAAGAT CTATATCACC
GACTCAGGCC TGGGTGCGAT CATAGTTGTC GACACAGCTA CGGGCAACGT CCGCCGGCTT
CTGGCGCACC ATGCTTCAAC CAAGGCTGAG GAGATCACAC TCAGGATCGA CGGCAAGGAG
TTCCTGCGCA ACGGCAAGCC TCCGCGCATC CATTCCGACG GCATCGAGCT CGACCGGAAA
AACGGATACC TCTACTACCA CGCCCTCACC GGCTACCATC TCTACCGGGT CCCCACCAGC
GCGCTCGCGG CGGCGTTTTT CGATCCGAGA CTGGAAGCAG CCCTTGAGGC GAAGGTGGAA
GATCTAGGAA AGACTCCCGC TCCCGACGGG ATGATGTTCG ATGCGGTGGG AAACCTCTAT
ATGGGCGACC TGGAGCACGA TGCCATCGTC TACCGCACCC CGGCCGGTGA GATACTGACG
CTGGTCCAGG ACCCGCGCAT TCGCTGGGCC GACACCTTCA CCATTGATCC AAACGACTCC
CTTATCTTCA CGGCGTCCAG AATTCACCAG GTACCGCAGA GCGGCGGAAT AGAGGAGATG
GAATTTCCGA TCTATTCCCT GCAGCTACCT CCCTCCGCCG CGCCTCAATG A
 
Protein sequence
MAKEGLLWPR GVQLKTKPST AIATMARSVC LCLLTISICL GCAHKQTPQS DGLKSVPDRL 
KTIATFRGAQ VTGVTSTDTG RLFANFPRWR EGVPFSVVEV SPDGSFTPYP DAEWNRWEGY
PQPDRFTCVQ SVVAHGDSLY VLDPSNPQFA GVVGSAKLFV FDLKTNRLKR RYEFHNGVAP
ERSYLNDLRI DDAAGKIYIT DSGLGAIIVV DTATGNVRRL LAHHASTKAE EITLRIDGKE
FLRNGKPPRI HSDGIELDRK NGYLYYHALT GYHLYRVPTS ALAAAFFDPR LEAALEAKVE
DLGKTPAPDG MMFDAVGNLY MGDLEHDAIV YRTPAGEILT LVQDPRIRWA DTFTIDPNDS
LIFTASRIHQ VPQSGGIEEM EFPIYSLQLP PSAAPQ