Gene GM21_3550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3550 
Symbol 
ID8138922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4111489 
End bp4112715 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content67% 
IMG OID644871169 
Productprotein of unknown function DUF214 
Protein accessionYP_003023329 
Protein GI253702140 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones161 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCTGC GCATTCTCAA AAACTCCATC CTCAAGCGCC CAAAGCCCGT GGCTTTGGTG 
CTCCTCTCCA TCGTGATGGG ATCCGCCGTC GCCACCGCCT TCCTGGGGAT CTCCGGAGAG
CTGTCGCACA AGATGGCGCT GGAGCTCAGA AGCTACGGCG CCAACATCGT CCTGGAGCCG
GCGGCCGGAG AGGCGGGTGC GCTCAACTCC GACGACCTTC CCAAAATCAA GACCATCTTC
TGGAAGCACA ACATCGTGGG TTTCGCCCCC TTTCTTTTCG GGCAGGTCGA CTTTTCCGCC
CCCGGCGGGC GGGAGCGTGG CGTACTGGCC GGCACCTGGT TCGGCAGGCC GTTGCAGGTC
GAGGGTGAGC CGGAGAGCAT CCAGGGGGTG AAGGTGACCG CTCCCTGGTG GGAGCTCACC
GGCCGCTGGC CCGAGACCCC GGACGAGGCC GTGGTGGGGG CATCGCTCGC CAAAAGGCTG
CGCCTTGCCG AGGGGTCGGA AATCGTGGCC GGCGCCGGCG GCACAAGCCG CAGCTTCAGG
GTGGTCGGTA TCGCCGCCAC CGGCGGCTTC GAGGAGGAGC AGCTCTTCGC CCCGCTGGCC
GAGGTGCAGG CCCTCCTCGG GAAGCCGGGG AAGCTCTCGC GCGTCCTGGT GAGCGCGCTC
ACCGTCCCGA TGGACGACTT CGGCAGGAAG GATCCCGCCT CCATGAGCAA GGACGAGTAC
GAGAAGTGGT ACTGCACCGC CTACGTCACC TCGGTGGCCA AGGGGGTGGA GGAGGCGATG
GCCGGCAGCC GTGCCAAGCC GATCTGGCAG ATCGCCGGCG CCGAGGGGGC GCTTCTTGAG
AAGCTGAACA GCCTGATGCT GCTTTTAACC GTGCTGGCGC TTTGCTCCGC CGCCATCGCC
GTCTCGTCGA GCCTCATGGC GTCCATGGCC GAGAGAAGCG GCGAGATCGC CCTGATGAAG
GCGATGGGCG CCGACCGCAT CCAGATCGCG TCCATCTTCC TGGGGGAGAC CATGGCGATA
GCGCTTCTGG GCGGGGTGAT CGGGTACTTC GCCGGGGACC GGCTCGCCGT CGTCGTCAGC
CGCGCCGTCT TCGATTCGGC CGTGGCCTCG CCGGTCTGGC TTTTCCCCAC CGCCCTCGGA
TCGTCCTTCC TGGTGGCGCT TCTGGGGAGC CTCGCGCCGC TGAAAAGGGC GCTTGCCGTG
GAGCCGGTAC GGGTCCTCAA GGGGTGA
 
Protein sequence
MHLRILKNSI LKRPKPVALV LLSIVMGSAV ATAFLGISGE LSHKMALELR SYGANIVLEP 
AAGEAGALNS DDLPKIKTIF WKHNIVGFAP FLFGQVDFSA PGGRERGVLA GTWFGRPLQV
EGEPESIQGV KVTAPWWELT GRWPETPDEA VVGASLAKRL RLAEGSEIVA GAGGTSRSFR
VVGIAATGGF EEEQLFAPLA EVQALLGKPG KLSRVLVSAL TVPMDDFGRK DPASMSKDEY
EKWYCTAYVT SVAKGVEEAM AGSRAKPIWQ IAGAEGALLE KLNSLMLLLT VLALCSAAIA
VSSSLMASMA ERSGEIALMK AMGADRIQIA SIFLGETMAI ALLGGVIGYF AGDRLAVVVS
RAVFDSAVAS PVWLFPTALG SSFLVALLGS LAPLKRALAV EPVRVLKG