Gene GM21_2221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2221 
Symbol 
ID8137558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2590909 
End bp2592579 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content63% 
IMG OID644869835 
ProductMammalian cell entry related domain protein 
Protein accessionYP_003022029 
Protein GI253700840 
COG category[R] General function prediction only 
COG ID[COG3008] Paraquat-inducible protein B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.00000538523 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGAGA CACCTGAAAA AAACGACCTC AACGACATCC CCGAGGCTGT CAGTGAGCCC 
AAGCGCCGCT TCAGCATCCA GCTGGTCTGG ATCATACCCA TCGTCGCCGC GCTGATCGGC
CTCTCCATCG CAGTCAAGGC TTACATCGAC CGCGGCCAGG CCATCACCAT CACCTTCAAG
ACGGGGGAGG GGCTGGAGGC GGGTAAGACC AAGCTCAAGT ACAAGGACGT GATGATCGGC
GAAGTGAAGT CGATCGCCAT CTCCAACGAC CGCTCCCACG TGGTGGTGAC TGCGGAGGTG
ACCAAGGACG CCCGCGGGCT CATGGTGAAG GACACCCGCT TCTGGGTGGT GCGCGCGCGG
ATTTCCGGCG GCAACGTTTC CGGCTTGAAC ACGCTCCTTG GTGGATCCTA CATCGGGGTC
GAGGCCGGAA GCTCGACCGA GGCGCGAGAA GAGTTCATCG GCCTCGAATC GCCTCCCGCC
GTATCCGTCG ACGTCCCCGG ACGCCAGTTC GTCCTGCACT CCGCCGAGGT CGGTTCGCTC
GATACAGGCT CCCCCATTTT CTTCCGCCGC ATGCAGGTGG GGCAGGTGAT AGGCACCGAA
TTGGACCGCG ACGGCAAAGG GGTGACGGTC AAAATCTTCA TCCGCTCCCC TTACGACAAA
TTCATCAAGG TCAACACCTA CTTCTGGCAT GCCAGCGGCA TCGACCTGAC TCTCAGCGCC
AGCGGCGTCA AGGTCAACAC CGAGTCCATG GTCTCGATCC TCCTGGGGGG TATCTCCTTC
GAGGTGCCGG AAGGAAAAGA GGATGCCTCG CCGGCTCCGC CCAATACCAT ATTCTCCCTG
TACGCGACGA GGGACGACGC GGCGAAACAC TCGGCGGCCG TGGAGAAATT CGTGCTCGTC
TTCAAGGAAT CGGTGCGCGG ACTCACCGTG GGGGCGCCGG TCGACCTGCG GGGGGTGACG
GTAGGAGAGG TCACCAAGAT AAACGTGGCG CTCGACCGCA GGGGGGCCGA TTTCACCGTT
CCGGTCGAGA TTCAGTTCTA CCCGGATCAC CTGCTTGCCG GGGGTAACAG CCAGGAGGAC
GCGCCCGAAA CTGGCGACAG GACGCTGAGA AGGCAGCTGG ACGAAATGGT CGCCCACGGC
TTCCGTGCGC AGATCAAGAG CGCCAGTCTC CTGACCGGGC AGCTTTACGT GGCTCTCGAT
TTCGTGCCGG GAGCGCGTGC CGCGAAGATC AACTGGGGCG CCGACCCGCC GCGCTTCCCG
ACCGTCCCGG GGTCGATGGA GAAGCTGCAG AAGAACCTGA TCGAGATCGT GCAGAGGATC
GAGAAACTCC CCCTGGAGCA GATCGCCGGC GACGCGGGGA CCACCATACG CTCGCTCGAT
TCGACCTTGA AAAGCGCCGA CCAGTTGCTG AAGAACATGG ACCGTACGCT GGTCCCCGAG
GCACGGAGCG TCCTTGCCGA GTCGCGGCAG GCCATCGACG AAGTGAAGAA GACCCTTGCC
GAAGCGCGTC AGACACTCGG CGGCGCCAGC GGGGTTCTCG CCCCCGATGC CCCGGTGCAG
GTCGACCTGC GCGACACCAT GCGCGAGGTG TCGCGCGCTG CCCAGTCGCT CAGGGTTCTG
GGCGACTACC TGGAACAGCA CCCCGAAACG CTCATCCGCG GCAAGAAATA G
 
Protein sequence
MTETPEKNDL NDIPEAVSEP KRRFSIQLVW IIPIVAALIG LSIAVKAYID RGQAITITFK 
TGEGLEAGKT KLKYKDVMIG EVKSIAISND RSHVVVTAEV TKDARGLMVK DTRFWVVRAR
ISGGNVSGLN TLLGGSYIGV EAGSSTEARE EFIGLESPPA VSVDVPGRQF VLHSAEVGSL
DTGSPIFFRR MQVGQVIGTE LDRDGKGVTV KIFIRSPYDK FIKVNTYFWH ASGIDLTLSA
SGVKVNTESM VSILLGGISF EVPEGKEDAS PAPPNTIFSL YATRDDAAKH SAAVEKFVLV
FKESVRGLTV GAPVDLRGVT VGEVTKINVA LDRRGADFTV PVEIQFYPDH LLAGGNSQED
APETGDRTLR RQLDEMVAHG FRAQIKSASL LTGQLYVALD FVPGARAAKI NWGADPPRFP
TVPGSMEKLQ KNLIEIVQRI EKLPLEQIAG DAGTTIRSLD STLKSADQLL KNMDRTLVPE
ARSVLAESRQ AIDEVKKTLA EARQTLGGAS GVLAPDAPVQ VDLRDTMREV SRAAQSLRVL
GDYLEQHPET LIRGKK