Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3370 |
Symbol | |
ID | 8138737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 3902392 |
End bp | 3903768 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644870988 |
Product | aromatic hydrocarbon degradation membrane protein |
Protein accession | YP_003023153 |
Protein GI | 253701964 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG2067] Long-chain fatty acid transport protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 87 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAAAAT CGGTGTGCTG TGCGGGCGTG GCCTGCGCCT TCTTGGGGGG GGCCGGCATC TGCCACGCGG CGGGGTTCAA GGTGAGCGAG CAGGGGGCCA AGGCGATGGC TATGGGAAAC GCCTTCGCGG CACAGGCGGA CGACCCGAGC GCCCTGTACT TCAACCCCGC CGGCATCTCG TTTCTGCGCG GCGCGCAGGC GAACCTCGGC TCGTTGGCCA TACTGGTGCC CCAGACCGAG TTTCACGGCA CCACGCCACT GAGCGGCACC CCTCCCCTGG ACATCGGGAC TGCCCATGTG ACGGATAAAT CCAGAAGGGA CATCGTGGTA GCCCCGACCC TCTACGCCAC CTACAGCATG GAGACGCTCC CGCTTTCCTT CGGCCTGGGC GTCAACGCGG TGTACCCACT GACCAAGAGC TGGGACGACT CCAGCGTCTT CAGGAACCAG GTGCAGACCG CCTCGATCAA ACCGGTCAAC TTTCAGCCGA CGGTGGCGTA CCGTTTCGAC GACCTGAAGC TGGCGGTAGC CGGGGCTCTC GATGTCACCT ACGCCGTGGT CTCGCTGCAG AAGACGGCTT ATGCCCCCGC CATAGATCCC TCCGCGCCGG CTCCCCCCTT CGGCGCCTAC GAGCTCGGAT CGCTGGGGCT GGACGGGACG GCTACAGGCG TTGGGTACAA CTTCGGGATC CTCTGGAAAC CGCGGCCGCA GTACAGCTTC GGCGTGGCCT ACCGGAGCCG GATCACCCTC GACGTCAACG GCGACGCCAA TTTCCTCGCC ACCACCCCCA CCGGCCTTGG GGCCACCGGC CTTGGGGCCA TCGGCCTCTC GGAGGCCTCC CCCTTCCCCT ACACCAGGGC CCGCGCCGCC AGCGCCGCAT CGACCCGGAT CGTCCTACCA GACACCCTGG ACGTGGGCAT TGCCTGGCGC CCCACGGAAA AACTTACTTT AGAGTTCGAC GCCACCCGGA CCGGCTGGAG CAGCTTCGAC CAGTTGCTGA TCGAGTTCGA CTCCCCTGGG TTCGCGTCCT TCAACAACCG GCCGGACCCC AGGAACTGGC GCGACGTCTG GGCCTACAAG TTCGGCGGAC AATACTCCCT GAACGACACC CTCGACCTGC GCGCCGGCTA CTCCTTCGAC AACACCCCCG TTCCCGATGC CACCCTCGAT CCGCTGCTCC CCGACGCCGA CCGCCACAGC TTCGCCGTCG GCGCCGGCAT TCACCACAGC TTCGGTATCC TCGACCTCGC CTACATGTGG GTGCACTTCG TGGACCGCAC GGTCGACAAC CAGGACATGG CGGCGCTGCG CGGGTCCAAC GGCACCTTCA AAAGCGACGC CTACCTGTTG GCCGCGAACC TGAACTTTAA ATTCTGA
|
Protein sequence | MIKSVCCAGV ACAFLGGAGI CHAAGFKVSE QGAKAMAMGN AFAAQADDPS ALYFNPAGIS FLRGAQANLG SLAILVPQTE FHGTTPLSGT PPLDIGTAHV TDKSRRDIVV APTLYATYSM ETLPLSFGLG VNAVYPLTKS WDDSSVFRNQ VQTASIKPVN FQPTVAYRFD DLKLAVAGAL DVTYAVVSLQ KTAYAPAIDP SAPAPPFGAY ELGSLGLDGT ATGVGYNFGI LWKPRPQYSF GVAYRSRITL DVNGDANFLA TTPTGLGATG LGAIGLSEAS PFPYTRARAA SAASTRIVLP DTLDVGIAWR PTEKLTLEFD ATRTGWSSFD QLLIEFDSPG FASFNNRPDP RNWRDVWAYK FGGQYSLNDT LDLRAGYSFD NTPVPDATLD PLLPDADRHS FAVGAGIHHS FGILDLAYMW VHFVDRTVDN QDMAALRGSN GTFKSDAYLL AANLNFKF
|
| |