Gene GM21_3638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3638 
Symbol 
ID8139012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4215788 
End bp4216912 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content58% 
IMG OID644871259 
Productprotein of unknown function DUF214 
Protein accessionYP_003023417 
Protein GI253702228 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones153 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTGAAC GCCACCGCTA CATCCTCGAC TTCACGTTGA CCTCCTTTCT CCGCAGGAAA 
GGGAAGAACG CCGTCCTTCT CGTTGTCTAC ACCCTTGTAG TATTCGTAGT GGCTTCAGTG
CTGTTTTTCA CTCATGCCTT ACGGTACGAG GCGTCGCTGC TGCTCAAAGA CGCGCCTGAT
ATCGTGGTCC AAAATACCCT GGCCGGACGT CAGCATCCTG TCCCAATTGA ATGGCGCCGG
TCCATCGGCG CCATCCGCGG CGTAGCCTCT GCCGCACCTA GGCTCTGGGG GTACCACTAC
GATGAAGCGT TTGCAGCGAA CTACACCCTC CTGGTACCCG TCAAAGACGA GCCGCCGTCG
GGAAGTATGG ATATCGGCAG CGCCATCTCC CGCACCCGCA ACGCCTATCC CGGCGACATC
ATCTCCCTCC CCGGACGCGA TGGCCGCCCC CGCGCCTTTA CCGTTCGGCG GGCCCTGACC
TCGGATTCGC AGCTCCTGAC CGGAGACCTC ATGGTCCTTT CGGAGAAAGA CTTCAGGGAG
CTTTTCGGTA TCCCGAAAGA TCAGGCGACA GATCTCGTGT TGCGGGTCCC CAATGCCCGG
GAGCAGCGCA CCGTAGCGAA GAAGATCACC CGGCTTTATC CTGAGGCGCG CCCCATCTTG
CGCGAGGAGA TGCGACGGAC CTATGACGCC GTTTATGGCT GGCGTTCCTC GCTTCTCTTG
GTCGTCTTCA GCGGAGCAGG TTTGGCCTTC TTTATCTTCG CATGGGATAA GGCTACGGGC
ATCTCCGCCG AGGAAAGAAA AGAAATCGGC ATCCTCAAGG CTATCGGGTG GGAGACTTCA
GACATACTGT TGATGAAGTT CTGGGAGGGA ATCGTGATTT CGCTTTGTTC TTTCCTGGCG
GGAAGCATCC TTGCCTACTT CCACGTCTTT GTCTCTTCCT CTGCACTTTT TCTTCCCGTC
CTTAAAGGAT GGTCGACCCT CTATCCCACC TTCAGGCTTC AGCCATCCAT CGATCACTGG
CAGCTTGCGG TCCTCTTCTT CTTGACGGTG GTTCCATACA CCATCGCTAC CATTATCCCT
TCCTGGCGCG CGGCAACGAC CGATCCCGAT GTGGTGATGA GGTGA
 
Protein sequence
MIERHRYILD FTLTSFLRRK GKNAVLLVVY TLVVFVVASV LFFTHALRYE ASLLLKDAPD 
IVVQNTLAGR QHPVPIEWRR SIGAIRGVAS AAPRLWGYHY DEAFAANYTL LVPVKDEPPS
GSMDIGSAIS RTRNAYPGDI ISLPGRDGRP RAFTVRRALT SDSQLLTGDL MVLSEKDFRE
LFGIPKDQAT DLVLRVPNAR EQRTVAKKIT RLYPEARPIL REEMRRTYDA VYGWRSSLLL
VVFSGAGLAF FIFAWDKATG ISAEERKEIG ILKAIGWETS DILLMKFWEG IVISLCSFLA
GSILAYFHVF VSSSALFLPV LKGWSTLYPT FRLQPSIDHW QLAVLFFLTV VPYTIATIIP
SWRAATTDPD VVMR