Gene GM21_2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2018 
Symbol 
ID8137352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2340421 
End bp2342964 
Gene Length2544 bp 
Protein Length847 aa 
Translation table11 
GC content67% 
IMG OID644869631 
Productprotein of unknown function DUF214 
Protein accessionYP_003021828 
Protein GI253700639 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0000000000691729 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCCTTT GGCGGGCCGG GGTGCGCAAC GTGCTTCGTC ATCCCTGGCT CACGGTGCTG 
GCGGTCTTGG GGGTCGCCCT CGGCGTCGCC GTGGTCACAG CCGTCGACCA CGCCAACCAG
GCCGCCCAGC GCGCCTTCCA GATCGCCGCT GAGACCGTGG CCGGGCGCGC CACGCATCAA
ATTGTCGGCG GACCTTCCGG GATTCCCGAA CAGCTTTACC GCACCATCAA GGTCGACCTC
CGTTTCCGCG CCGCTGCTCC CGTGGTGACC GGGAACCTGC AGCTCGCAGG GCAGCCGGGG
AGGACCTTCC AACTCATCGG CGTAGATCCC TTCTCCGAAT CCCCCATCCG CTCCTTCAGC
TCCCGTTTCA CCGACAAGGA GATCCTGCCC AGGCTCATGG GGCATCCCGA CACGGTCCTC
ATGCTGCGTC GGACCGCCAC CGAGCTTTCG CTCTCGCGCG GACAGAGCTT CGCCGTGGAC
GTCGCCGGGG TTAGGCGCCG ACTGATCCTG GCGGGACATA TCGAGCCGCC GGACGAGGTG
AGCGGCGTCG CCCTTGTTTC CGTCCTGGTG AGCGACATCG GCACCGCGCA GGAGATCCTC
AACCGTACCG GACGCCTGTC GCGCATCGAC CTGGTCGTGC CCGAGGGTAG GGAAGGTGAG
ACCGTGCTGC GGCGGTTGGG CGCCGCGCTC CCCCCCGAGG CCTCTATCGT CCCCGCCGGC
GCCCGCGCGG GCGCGATGGA GAAGATGACC CGCGCCTTCC GCCTCAACCT CACCGCCTTG
AGCCTTTTGG CCCTTGTTGT CGGCATGTTC CTCATCTACA ACACCATGAC CTTTTCGGTG
ATCCGCAGAA GACGCCTGAT CGGCATGCTG CGCGCCCTGG GCGTGAGCCG CAGGGAGATC
TTCCTGATGA TCTGCGCCGA GGCCCTCTTG ATCGGCGCGG CAGGCACCGT CGCCGGGCTT
CTCTGCGGAG AGCTCCTGGG GAGCGAACTG ACGCGCCTGG TAACCAGGAC CATCAACGAC
CTGTACTTCG TGATGGAGGT GCGCAAGGTT CCTCTGCTCC CCTTGGCGCT TTGGAAGGGG
GCTCTGCTAG GCGTCGGAGC GACGCTCGTC GCGGCCTTCC CCGCCGCCCT GGAGGCGACC
AGCGCGCCGC CACGGGCGGT CATGTCCCGC TCGCTCATCG AGGCGCGCCA CAGGAAGCTG
GTGCCGCTGG CGACATCGGC CGGGGTGCTG CTGATGCTGA TCGGTTGCGG GCTTTTTCTT
TATCAGCAAG GGGGCGTGGC GGGAGGTTTC GTCGGGCTCT TCGCCGTCAT CGTCGGCTAT
ACACTCCTGG TTCCCGCGGC CGTGCTCGCT TTTGCCCGGG CGCTCGCCCC GGCGATGGGA
AAGCTCGCCG GGTCTATAGG GAAGATGGCG GCACGCGGGG TGGCGGTCTC GCTGTCCCGT
ACCGGGGTGG CGACCGCCGC GCTGGTGGTG GCCGTCTCCG CAGGAATCGG GGTCGGGATC
ATGGTGGGGG GCTTTCGCCT GACCGTCCAG AACTGGCTCG CCAACTGGCT GCAGGCCGAC
GTCTACGTCA CCTCCGCCGA CAAAAGCGGC GGGCGCTACC GGCCTCCGCT TGATCCTGCC
CTGGTGCGGC GCCTCTCCTT ATTGCCGGGG ACAGCCGGGT TCACCCTTTC CCGGCGGGTC
TCACTGGAAG GGGCCGGTGG AGCGACCGAA CTCTTTTCGG TCTCGGTGCC GGAGGCGACC
TTCGCCCGCT ATCCCTTCAA GGAGGGGAAT CCCAAGGTCG CCTGGAAGAG CTTCGACGAG
GGGAAATCCG TGCTGGTCTC CGAGCCGTAC AGCTACCGCT ACCGGGTGCG CATCGGAGAC
CGGGTGGCGC TCCGCACCCG GTACGGGATG AAGCAGTTCC CCGTATCCGC CATCATCTAC
GACTACGGCA CCGATACCGG CATCGTGATC ATGAGCCGCA GGGGGTACCT GGAAAACTTC
AACGACCCTT CCGTCGACGG GATGTCCTTC ACCGCGGGAC CGGGGCAGAG CGTCGCGGCA
CTGATGCGAC AGATCCGCAG CAGCGCGGGG GAAGAGACGA TCAACGTCAT CTCCAACGCG
GAACTGCGCC GGGCGACGGT CGAGATCTTC GACCGTACCT TCGCCATCAC CTCTGTGCTG
AGGATACTCA CCATGCTGGT CGCCTTCGTC GGGATCCTCT CCGCCCTCAT GGCCATGCAG
GTGGAGCGCG CGAGGGAGCT GGCCGTGCTG CGGGCGGTCG GGCTCACGCC GGGTCAGGTC
TGGGGGGTGG TCTGCGGCGA GACCTTCCTG ATCGGACTCA TCGGCGGGGC GCTGTCGCTT
CCGCTGGGCA TCCTGGAGGC GCTGGTCCTC ATTTACGTGG TGAACCTGCG CTCCTTCGGC
TGGACCATGC AGCTCTCCAT CGAGCCGGCC TACCTGCTGC AGGCCCTGCT CCTTTCGGTG
GGGGCGGCGC TTTTGGCCGG CATCTACCCG TCGCTCAGGA TCGCCCGTAC CTCCCCGGCC
CTGGCCCTGA AGGAGGAGGA TTAG
 
Protein sequence
MILWRAGVRN VLRHPWLTVL AVLGVALGVA VVTAVDHANQ AAQRAFQIAA ETVAGRATHQ 
IVGGPSGIPE QLYRTIKVDL RFRAAAPVVT GNLQLAGQPG RTFQLIGVDP FSESPIRSFS
SRFTDKEILP RLMGHPDTVL MLRRTATELS LSRGQSFAVD VAGVRRRLIL AGHIEPPDEV
SGVALVSVLV SDIGTAQEIL NRTGRLSRID LVVPEGREGE TVLRRLGAAL PPEASIVPAG
ARAGAMEKMT RAFRLNLTAL SLLALVVGMF LIYNTMTFSV IRRRRLIGML RALGVSRREI
FLMICAEALL IGAAGTVAGL LCGELLGSEL TRLVTRTIND LYFVMEVRKV PLLPLALWKG
ALLGVGATLV AAFPAALEAT SAPPRAVMSR SLIEARHRKL VPLATSAGVL LMLIGCGLFL
YQQGGVAGGF VGLFAVIVGY TLLVPAAVLA FARALAPAMG KLAGSIGKMA ARGVAVSLSR
TGVATAALVV AVSAGIGVGI MVGGFRLTVQ NWLANWLQAD VYVTSADKSG GRYRPPLDPA
LVRRLSLLPG TAGFTLSRRV SLEGAGGATE LFSVSVPEAT FARYPFKEGN PKVAWKSFDE
GKSVLVSEPY SYRYRVRIGD RVALRTRYGM KQFPVSAIIY DYGTDTGIVI MSRRGYLENF
NDPSVDGMSF TAGPGQSVAA LMRQIRSSAG EETINVISNA ELRRATVEIF DRTFAITSVL
RILTMLVAFV GILSALMAMQ VERARELAVL RAVGLTPGQV WGVVCGETFL IGLIGGALSL
PLGILEALVL IYVVNLRSFG WTMQLSIEPA YLLQALLLSV GAALLAGIYP SLRIARTSPA
LALKEED