Gene GM21_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2054 
Symbol 
ID8137390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2380013 
End bp2381173 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content61% 
IMG OID644869669 
Productprotein of unknown function DUF214 
Protein accessionYP_003021864 
Protein GI253700675 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value4.01978e-17 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACTGC ACACGATATC CATCAACAAT CTGAAGCGCC GCAAGGCCAA GATGGCTTTC 
CTCACCATCG GCCTCATGGT CGGGATCGCC ACCATCGTCA CCCTGGTGAC CCTCACCAAC
TCCATGTCCA CCGATATCGA AAGAAAAATG GAGGAGTTCG GCGCCAACAT CCTGGTCACC
CCCCAGAGTA ACGGCCTCGC CATGAACTAC GGCGGCATAA GCCTGGGCGG GATCACCTTC
GACCAGCGCG AGATCAAGGA AGAAGACCTG GCCCAGATCC GCAAAATAAA GAACCAAAAG
AACATCGCGG TCATCTCGCC CAAGGTGCTG GGCGGGATCA AGGTCGGCAG CCAGGACGTG
CTGCTGGTCG GCGTCGACTT CGCCAGCGAA CTGAAGATGA AGCAGTGGTG GCAGATCTTC
GGCGACGCCC CGAAGGGAGA CAACGAGCTC CTTTTGGGAA GCGACGCCTC CAACGTCCTC
GATGCCGGCT CCGGCGACAG CATCCAGATC AAGGGGGAGA CCTTCAAGGT CGCCGGCGTC
CTGAACCAGA CCGGCTCGCA GGACGACTCG CTGGTCTTCG CCTCGCTCCC CAAGGCCCAA
AAGCTCCTGG GCAAGGAAGG GAAGATAACC ATGGCCGAGG TCGCCGCACA CTGCTCGGGC
TGCCCCATAG GGGACATGGT GACCCAGATC GCCGAGAAGC TCCCCGACAC CAAGGTCTCC
GCCATCCAGC AGGTGGTCGA GGGGCGGCTG AAGGCGCTGG ACCACTTCAA GCGCTTCTCC
TACGCCATGG CTGCGGTCGT CGTCTTCATC GGCTCGCTCA TCGTCTTCGT CACCATGATG
GGTAGCGTCA ACGAGCGCAC CACCGAGATC GGCGTGTTCC GCGCCATCGG TTTCCGCAAA
AGCCACATCA TGCGCATCAT CCTCCTGGAA GCCGCGCTGG TGAGCCTCCT GGCTGGGCTT
TTGGGTTACG CCGCCGGGAT GGGCGGGGCC AAGCTGGCGC TTCCCTTCAT GGCCGAAACG
AAAAACGCGC ATCTGGTCTG GGACAGCACC GTCGCCTTTG GTTCGGTGGG ACTTGCCGTA
CTGCTCGGCC TTCTGGCGAG CCTTTACCCC GCGCTTCACG CCAGCAAGAT GGATCCGACC
GAGGCCCTCA GGGCTCTTTA A
 
Protein sequence
MKLHTISINN LKRRKAKMAF LTIGLMVGIA TIVTLVTLTN SMSTDIERKM EEFGANILVT 
PQSNGLAMNY GGISLGGITF DQREIKEEDL AQIRKIKNQK NIAVISPKVL GGIKVGSQDV
LLVGVDFASE LKMKQWWQIF GDAPKGDNEL LLGSDASNVL DAGSGDSIQI KGETFKVAGV
LNQTGSQDDS LVFASLPKAQ KLLGKEGKIT MAEVAAHCSG CPIGDMVTQI AEKLPDTKVS
AIQQVVEGRL KALDHFKRFS YAMAAVVVFI GSLIVFVTMM GSVNERTTEI GVFRAIGFRK
SHIMRIILLE AALVSLLAGL LGYAAGMGGA KLALPFMAET KNAHLVWDST VAFGSVGLAV
LLGLLASLYP ALHASKMDPT EALRAL