Gene GM21_4105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4105 
Symbol 
ID8139479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4686818 
End bp4688152 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content65% 
IMG OID644871720 
Producthypothetical protein 
Protein accessionYP_003023878 
Protein GI253702689 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones117 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGTTA CCATCTTCAC ATCCCCCATT ACCCCCATCA GGAGCGGCCG TGATTCCCGC 
AGAATGGCCG CGCCGCCACC TTCCCTCGCC CGCAGCCTAA GGCGCCTTGC CCGCTTTTGG
CTTGCAGCCT GCCTGGCACT GCTCCTGCTC GTTCCCCCCG GCGCCCACGC TGCTCCTTCG
GAAACCTCCG CCACCCTCTA CTTCTTCTGG GGCGTCGGCT GCCCCCACTG CGTGAAGGCG
AAGCCCTTCC TGGAAGAGCT GAAAACGAAG TACCCGACCC TGCGCGTCGA GTCCCTCGAG
GTGCTGGAAA ACCGCGAGAA CCTTCCCCGG CTGATGGCGA TGGCCCGCGC CCGCCACAAG
GAGGCGACGG GGGTGCCGGT CTTCATCGTC GGCCAGGAGA TGTTCAGCGG CTTCTCTGCC
GAAACCGCGG CGGAGGTGGA ACAGGCGGTG CGCCTGGCCG TGCAGCCGGT CGCGCCGCAG
GAACAGGCCG CGAAACCCGC CCCCACCCCT TCCGTCAGGC TCCCGCTCCT GGGACCGGTC
GACGCGCAAA GCCTGTCGCT TCCCGTTTTC ACCGTCGCGG TCGCGCTTTT GGACAGCTTC
AACCCCTGCG CTTTCTTCGT GCTCTTCTTC CTGTTGAGCC TGTTGATCCA CGCCCATTCG
CGGCGCCGCA TGCTTCTCAT CGGCGGGCTC TTCGTCTTTT TCTCCGGGCT CGTCTACTTC
GTCTTCATGG CGGCCTGGCT CAACCTCTTC CTCATCACCG GCGGGCTTCC CGCCATCACC
TTCGCCGCGG GAATCGTCGC GCTCTTCGTC GGCGCGGTGA ACGTCAAGGA ATTCTTCTAT
TTCGGGCAGG GGGTCTCGCT CAGCATCCCG GAACAGCAGA AACCGAAGCT CTTCGCCCGC
ATGAGGAGGC TCCTCAGGGC GGATTCGCTC CCCTCCCTTT TGGCCGGGAC CACGGTGCTG
GCTCTCGCCG CCAACAGCTA CGAGCTCCTC TGCACCGCCG GCTTTCCCAT GGTCTTCACC
CGCATGCTCA CCCTGAGGGA ACTCTCCACC TACGGCTACT ACGCCTATCT TGCCTTCTAC
TGCACGATCT ACGCGCTCCC CCTGGCGGTC ATCGTCGCGA TCTTCACGGT GAAGCTCGGT
GAACGGAAGC TGACAATATG GCAGGGACGG GTGCTGAAGC TGGTCTCGGG ATTGATGATG
CTGGGGCTGG GGCTCGTGCT GCTCATCGAC CCGGCCCTGC TGAACAACCC GTTGGCCTCG
GCTGCGCTTC TCGGGGGGAC GCTGACCACG ACGGCGCTTT TGGCGGCTTT CGCCAGAAAA
AGGGGCGCGG GTTAG
 
Protein sequence
MPVTIFTSPI TPIRSGRDSR RMAAPPPSLA RSLRRLARFW LAACLALLLL VPPGAHAAPS 
ETSATLYFFW GVGCPHCVKA KPFLEELKTK YPTLRVESLE VLENRENLPR LMAMARARHK
EATGVPVFIV GQEMFSGFSA ETAAEVEQAV RLAVQPVAPQ EQAAKPAPTP SVRLPLLGPV
DAQSLSLPVF TVAVALLDSF NPCAFFVLFF LLSLLIHAHS RRRMLLIGGL FVFFSGLVYF
VFMAAWLNLF LITGGLPAIT FAAGIVALFV GAVNVKEFFY FGQGVSLSIP EQQKPKLFAR
MRRLLRADSL PSLLAGTTVL ALAANSYELL CTAGFPMVFT RMLTLRELST YGYYAYLAFY
CTIYALPLAV IVAIFTVKLG ERKLTIWQGR VLKLVSGLMM LGLGLVLLID PALLNNPLAS
AALLGGTLTT TALLAAFARK RGAG