Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2221 |
Symbol | |
ID | 8137558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2590909 |
End bp | 2592579 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644869835 |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_003022029 |
Protein GI | 253700840 |
COG category | [R] General function prediction only |
COG ID | [COG3008] Paraquat-inducible protein B |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.00000538523 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTGAGA CACCTGAAAA AAACGACCTC AACGACATCC CCGAGGCTGT CAGTGAGCCC AAGCGCCGCT TCAGCATCCA GCTGGTCTGG ATCATACCCA TCGTCGCCGC GCTGATCGGC CTCTCCATCG CAGTCAAGGC TTACATCGAC CGCGGCCAGG CCATCACCAT CACCTTCAAG ACGGGGGAGG GGCTGGAGGC GGGTAAGACC AAGCTCAAGT ACAAGGACGT GATGATCGGC GAAGTGAAGT CGATCGCCAT CTCCAACGAC CGCTCCCACG TGGTGGTGAC TGCGGAGGTG ACCAAGGACG CCCGCGGGCT CATGGTGAAG GACACCCGCT TCTGGGTGGT GCGCGCGCGG ATTTCCGGCG GCAACGTTTC CGGCTTGAAC ACGCTCCTTG GTGGATCCTA CATCGGGGTC GAGGCCGGAA GCTCGACCGA GGCGCGAGAA GAGTTCATCG GCCTCGAATC GCCTCCCGCC GTATCCGTCG ACGTCCCCGG ACGCCAGTTC GTCCTGCACT CCGCCGAGGT CGGTTCGCTC GATACAGGCT CCCCCATTTT CTTCCGCCGC ATGCAGGTGG GGCAGGTGAT AGGCACCGAA TTGGACCGCG ACGGCAAAGG GGTGACGGTC AAAATCTTCA TCCGCTCCCC TTACGACAAA TTCATCAAGG TCAACACCTA CTTCTGGCAT GCCAGCGGCA TCGACCTGAC TCTCAGCGCC AGCGGCGTCA AGGTCAACAC CGAGTCCATG GTCTCGATCC TCCTGGGGGG TATCTCCTTC GAGGTGCCGG AAGGAAAAGA GGATGCCTCG CCGGCTCCGC CCAATACCAT ATTCTCCCTG TACGCGACGA GGGACGACGC GGCGAAACAC TCGGCGGCCG TGGAGAAATT CGTGCTCGTC TTCAAGGAAT CGGTGCGCGG ACTCACCGTG GGGGCGCCGG TCGACCTGCG GGGGGTGACG GTAGGAGAGG TCACCAAGAT AAACGTGGCG CTCGACCGCA GGGGGGCCGA TTTCACCGTT CCGGTCGAGA TTCAGTTCTA CCCGGATCAC CTGCTTGCCG GGGGTAACAG CCAGGAGGAC GCGCCCGAAA CTGGCGACAG GACGCTGAGA AGGCAGCTGG ACGAAATGGT CGCCCACGGC TTCCGTGCGC AGATCAAGAG CGCCAGTCTC CTGACCGGGC AGCTTTACGT GGCTCTCGAT TTCGTGCCGG GAGCGCGTGC CGCGAAGATC AACTGGGGCG CCGACCCGCC GCGCTTCCCG ACCGTCCCGG GGTCGATGGA GAAGCTGCAG AAGAACCTGA TCGAGATCGT GCAGAGGATC GAGAAACTCC CCCTGGAGCA GATCGCCGGC GACGCGGGGA CCACCATACG CTCGCTCGAT TCGACCTTGA AAAGCGCCGA CCAGTTGCTG AAGAACATGG ACCGTACGCT GGTCCCCGAG GCACGGAGCG TCCTTGCCGA GTCGCGGCAG GCCATCGACG AAGTGAAGAA GACCCTTGCC GAAGCGCGTC AGACACTCGG CGGCGCCAGC GGGGTTCTCG CCCCCGATGC CCCGGTGCAG GTCGACCTGC GCGACACCAT GCGCGAGGTG TCGCGCGCTG CCCAGTCGCT CAGGGTTCTG GGCGACTACC TGGAACAGCA CCCCGAAACG CTCATCCGCG GCAAGAAATA G
|
Protein sequence | MTETPEKNDL NDIPEAVSEP KRRFSIQLVW IIPIVAALIG LSIAVKAYID RGQAITITFK TGEGLEAGKT KLKYKDVMIG EVKSIAISND RSHVVVTAEV TKDARGLMVK DTRFWVVRAR ISGGNVSGLN TLLGGSYIGV EAGSSTEARE EFIGLESPPA VSVDVPGRQF VLHSAEVGSL DTGSPIFFRR MQVGQVIGTE LDRDGKGVTV KIFIRSPYDK FIKVNTYFWH ASGIDLTLSA SGVKVNTESM VSILLGGISF EVPEGKEDAS PAPPNTIFSL YATRDDAAKH SAAVEKFVLV FKESVRGLTV GAPVDLRGVT VGEVTKINVA LDRRGADFTV PVEIQFYPDH LLAGGNSQED APETGDRTLR RQLDEMVAHG FRAQIKSASL LTGQLYVALD FVPGARAAKI NWGADPPRFP TVPGSMEKLQ KNLIEIVQRI EKLPLEQIAG DAGTTIRSLD STLKSADQLL KNMDRTLVPE ARSVLAESRQ AIDEVKKTLA EARQTLGGAS GVLAPDAPVQ VDLRDTMREV SRAAQSLRVL GDYLEQHPET LIRGKK
|
| |