Gene GM21_0988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0988 
Symbol 
ID8136309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1167010 
End bp1168176 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content63% 
IMG OID644868602 
Productprotein of unknown function DUF214 
Protein accessionYP_003020811 
Protein GI253699622 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones82 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAATCC CTTACTCCTA CAGTTTCCGC AACCTTTGGA CCCGGCGGCT CACCACGCTT 
CTCACCGCGA GCGGGATGGG TCTCGTCGTC TTCGTCTTCG CCGCCACCCT CATGCTCACC
GAGGGGTTGC AAAAGACCCT GGTGCAGACA GGCTCTCCCG ACAACGTGGT GCTGCTTCGG
AAGGCCGCCG GTTCCGAGGT GCAAAGCGGC GTGGAGCGCT CCCAGGCGGC CCTTCTGGAG
AGCCAGCCCG AAGTCGCCAT CGGCGCCGAC GGAGAGCCGC TTTTAGCCAA GGAAGTAGTG
GTGCTGATCA ACCTGAAAAA GCGGGTGGGG GACAAGCCCA GCAACGTGGT GATCAGGGGG
GTGACGCCGA CCTCGCTCAG GCTGCGTCCC GCCATCCGGC TAAAGGAGGG GCGCATGCCG
CGGCCAGGTT CCGCCGAGGT GATCGCAGGC GAGAGCATCG CCCGGCGCTT CAAGGGGGGG
GGGATGGGTG AGACCATCCG GTTCGGGATG CGGGACTGGC GGGTGGTAGG CGTCTTCGAT
GCCGGTTCCA CCGGATTCTC CTCCGAGATC TGGGGGGACG CCGACCAGTT GATGCAGGCC
TTTCGCAGAC AGGCCTACTC CTCCATCATC TTCCGGTTGC GGGACTCGAC CAGGTTCGAC
TCCTACAAGG CGCGGGTGGA GAGCGACCCG AGGCTCACCG TCGAGGCCAA GCGGGAGACC
CAGTATTACC TGGACCAGTC CGAGGCCATG TCCAAGTTCC TCAACATCCT CGGGATGGTG
TTGACCGTGG TCTTTTCCAT AGGCGCCGTG ATCGGGGCGA CCATAACCAT GTACGCCGCC
GTCGCCAACC GCGTCACCGA GATCGGGACC CTGCGCGCGT TAGGGTTCCA GAGGAAGAGC
ATCCTCTCCG CCTTCATCGT CGAGGCTCTT TTTTTGGGGC TTTGCGGCGG TGGGCTGGGG
ATCTTCGCCG CGAGCTTCAT GCAGCTCATC ACCATTTCCA CCATGAACTG GGCCTCCTTC
TCCGAGCTCG CCTTTTCCTT CACGCTCAAC TTTTCCATCG TTTGGAAGTC CTTGCTTTTT
TCCGCCGTAA TGGGGCTAGT GGGAGGCACG TTGCCCGCCT TCCGTGCCTC GCGGATGAAT
ATCGTGGAAG CCCTGAGGGC GACATAG
 
Protein sequence
MGIPYSYSFR NLWTRRLTTL LTASGMGLVV FVFAATLMLT EGLQKTLVQT GSPDNVVLLR 
KAAGSEVQSG VERSQAALLE SQPEVAIGAD GEPLLAKEVV VLINLKKRVG DKPSNVVIRG
VTPTSLRLRP AIRLKEGRMP RPGSAEVIAG ESIARRFKGG GMGETIRFGM RDWRVVGVFD
AGSTGFSSEI WGDADQLMQA FRRQAYSSII FRLRDSTRFD SYKARVESDP RLTVEAKRET
QYYLDQSEAM SKFLNILGMV LTVVFSIGAV IGATITMYAA VANRVTEIGT LRALGFQRKS
ILSAFIVEAL FLGLCGGGLG IFAASFMQLI TISTMNWASF SELAFSFTLN FSIVWKSLLF
SAVMGLVGGT LPAFRASRMN IVEALRAT