Gene GM21_0989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0989 
Symbol 
ID8136310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1168381 
End bp1169538 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content63% 
IMG OID644868603 
Productprotein of unknown function DUF214 
Protein accessionYP_003020812 
Protein GI253699623 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones87 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCATGC TGAAATACAT CATACGGAAC CTCTTCCGGC ACAAACTCCG CTCGGTGCTT 
ACCGTGGTGG GCGTCGCCGT CGCGGTCCTC GCCTTCGGAC TTTTGCGCAC CCTGGTCGGG
CTTTGGTACG CCGGCGCCGA GCACGCCTCG GACACGAGGC TCGTCACCCG CAACGCCATC
TCGCTCGTCT TCCCGCTCCC CATCTCCTAC CTGGACCGCA TCCGCGGCGT CTCCGGGGTG
AGCTCGGTCT CCTACGGCAA CTGGTTCGGG GGCGTCTACA TTGAGGAGAA GAACTTCTTC
GCCAACTACG CCGTCGAGCC GCGCACCTAC CTAGCCCTCT ACCCCGAACT GGTCCTCACG
GAGAAGCAGA AAAACGACTT CATCCTGGAC CGTAAGGGGT GCATCGTCGG AGAGCGCCTG
GCGAAGACCT ACGGCTGGAA GGTGGGGGAT CTCATCACCC TGAAAGGGAC CATCTTTCCC
GGCAACTGGG AGTTCGTGCT GCGCGGGATC TATCACGGCG CCGAGAAGGC GACCGAGGAG
CGGCTGCTCC TTTTCCACTG GAGCTACCTG AACGAGAGCG TGCGCCGGAG TTCCCCCGGC
AGGGCGGACC AGGTCGGGTT CTTCATGATT GGGGTGAAGC GCCCCGAGCT GGCCCCCGAG
GTTTCCCTTG CCGTCGACTC CATGTTCAAG AACTCCCTGG CCGAGACCCT CACCGAGACC
GAGAAGGCTT TCCACATGGG ATTCATCGCC ATGACCGAGG CGATCATGGT GGCGATCCAG
ATCGTGTCCT ACATGGTCAT CGCCATCATC ATGGTGGTCG CGGCCAACAC CATGGCGATG
ACGGCGCGCG AGAGGATCGG CGAGTACGCG ACCCTGAAGA CGCTGGGGTT CAAGGCGTGG
CACCTGGCAG GGCTCATCTT CGGCGAGTCC GTCGCCATCT CCGTTTTGGG GGGCGTCCTG
GGAGTGGCGG CAACATTCCC GGTCGCCCAC TGGATCGAGG TCGAGTTAGC GCAGTACTTT
CCTTTTTTCA GCGTCTCGAT GGAGACCCTG CTTCTGGAGT TACTGGCCGC CCTTTCCGTC
GGAGTCGTCT CCGGGATCTT TCCCACCTGG CGCGGCGCCA CCATCCGCAT CGCGCAAGGG
CTGAAGCGCA TAGGCTAA
 
Protein sequence
MFMLKYIIRN LFRHKLRSVL TVVGVAVAVL AFGLLRTLVG LWYAGAEHAS DTRLVTRNAI 
SLVFPLPISY LDRIRGVSGV SSVSYGNWFG GVYIEEKNFF ANYAVEPRTY LALYPELVLT
EKQKNDFILD RKGCIVGERL AKTYGWKVGD LITLKGTIFP GNWEFVLRGI YHGAEKATEE
RLLLFHWSYL NESVRRSSPG RADQVGFFMI GVKRPELAPE VSLAVDSMFK NSLAETLTET
EKAFHMGFIA MTEAIMVAIQ IVSYMVIAII MVVAANTMAM TARERIGEYA TLKTLGFKAW
HLAGLIFGES VAISVLGGVL GVAATFPVAH WIEVELAQYF PFFSVSMETL LLELLAALSV
GVVSGIFPTW RGATIRIAQG LKRIG