Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2174 |
Symbol | |
ID | 8137510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2542514 |
End bp | 2543746 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644869789 |
Product | aromatic hydrocarbon degradation membrane protein |
Protein accession | YP_003021984 |
Protein GI | 253700795 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG2067] Long-chain fatty acid transport protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 125 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCAAA AGAAAGGCCT GCTGGTCGCG TTGTTGACAC CCATACTTGC TTTCAGCGGC GCAGCGGCGG TCCATGCTTC CGGTTACGCA GTATTCACGC ACGGCGCCTC TGCGTTGGGG CAAGGCAACG CCGTGACAGC CCATAGCAGC GATCCCAGCA CCATTTTCTA CAACCCCGCC CTTATGAACA AGCTGGAGGG GACCCAGGTC CAGGTAGGGA CCACCGGCGT GTTCTCTTCC CGCCAGTACG AGCCCGCAGG CGCCGGCAAC GGAACCTCCA GCGACTCAGC CTTCTTTCCG AGCCACTTCT ATGCGACCCA CAAGCTGAAC AACCAGGTGA GCGTCGGCCT GGGCATCTTC AACCCCTTCG GTCTGGGGAC CGAGTGGGGC GAGCAGTGGG ACGGGCGCTA TATCGCCACC AAGTCGACGC TGAAGACATT CAACATCAAC CCTGCCGTTT CGGTTCAGGT CACCCCCAAG CTGGCTCTCG CCGCCGGGGT CGACGTAGTC CTGCTCGACG CGAGCCTGGA GAAGAAGGTC CCCTCCGGCG CGCTCGGCCT GACTCCCGAC TTCGATGTGA ACCAGAAGTT CAAAGGTGAC GGCAAGGGAG TAGGTTTCAA CGTCGGGGTA GCATACGACG TCTGCGACCA CTTCTCTCTT GGCGCCTCCT ACCGCAGCGA GGTGAAGATT GACGTGTCGG GCGATGCCGC ACTTACCGCT GGCGGGGCTT CGATTGCGTC CTTCGGGGGG AATACGGACC TGACCCTGCC GCCGCAGTTC ACCGCAAGCG CGGCCTTCAA AGGGATCGAC AAGCTAGTGC TCGAAGCGGG AGTGCGCTGG GAGGGATGGT CCAAGTTCAA GGAACTAGCG ATCCATGTCG ACAACGGCCA GCCTGCCGTC ACCCCTAGAA ACTGGAAGGA CAGTTGGGGG GGGAATGCAG GGGGCAGGTA CCAGATGAAC GACACCGTCG CGCTTCTGGC CGGCTACGTA TACGGCGACA CCCCCGTTCC CGACAGCACC TTCGACCCGT CGATACCCGA TGCCAAGACA CATGTTTTCT GCGCCGGCAC CGATTTGAAC TTCAATCCGG TCACCGTGGC GTTTTCCTAC GGCTACCAGC TGCTGGAAAA CAGGCACAAG GATAACGGGA TACCGGCCGG AGCCGCCCTG GCAAACGGCA AATACAAGAC CGACGCGCAC CTGGTAGCGC TCTCCGTGGG ATACAAGTTC TAA
|
Protein sequence | MNQKKGLLVA LLTPILAFSG AAAVHASGYA VFTHGASALG QGNAVTAHSS DPSTIFYNPA LMNKLEGTQV QVGTTGVFSS RQYEPAGAGN GTSSDSAFFP SHFYATHKLN NQVSVGLGIF NPFGLGTEWG EQWDGRYIAT KSTLKTFNIN PAVSVQVTPK LALAAGVDVV LLDASLEKKV PSGALGLTPD FDVNQKFKGD GKGVGFNVGV AYDVCDHFSL GASYRSEVKI DVSGDAALTA GGASIASFGG NTDLTLPPQF TASAAFKGID KLVLEAGVRW EGWSKFKELA IHVDNGQPAV TPRNWKDSWG GNAGGRYQMN DTVALLAGYV YGDTPVPDST FDPSIPDAKT HVFCAGTDLN FNPVTVAFSY GYQLLENRHK DNGIPAGAAL ANGKYKTDAH LVALSVGYKF
|
| |