Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3503 |
Symbol | |
ID | 8138875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4042222 |
End bp | 4044681 |
Gene Length | 2460 bp |
Protein Length | 819 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644871122 |
Product | hypothetical protein |
Protein accession | YP_003023282 |
Protein GI | 253702093 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 1.15934e-29 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGACC GCAAAAAAGA CCTGCTGATA CTGTCCGGCT TACTGGCAGT CCTCTTGATA CTTTTCTCCA AAATACTCTT CACCTCGCAG ATCATCCGGG CTCCGGACAT CATCAACGAG TTCTACTGGG GAATTAAGGA TTTCGGCAAG CAGCCGTTAC TCTCCATGTT CAGGGTCGAC TTCTCCAGCG CGGGATGGAG CCCCTTGCTG AACTCCGGGT TCACCAACGA GGGGGGGATG GCTTCGGAGC AGTTACTCGT CATTCGCAAC CTCATCTTCT GGGCAATCCC CGCACCCGCC AGCGTGGCGT GGTACATAGT GGCGCAGCTT TTCTTCGGCG CCGCCGGCGC CTATTGCCTG TGCCGGCTCA TCGGCGCCGG CAGGCCCGCC TCCTTCCTGG CCGGTCTCGT CTTCGCACTC GCTCCTGAAA ACGCTTCGCT GATCAACGCC GGCCACGTGA TGAAGATCGC CACCATCACT TTCGCGCCCT GGGCCTTTTA CTTCCTTGAG AAAGGGTTCA AGACCAGGCG CTTGATCTTC TTCCTGACCA CCGCCGTGGT GCTTGCTTTC CAGTTCTTCC ACACCCATTG GCAGATCGCT TACTACACCT GCCTCGCCAT CGGCGTCTAC GGCATCGCTC GTTCGCTCGT CATCGTAATG GGGGCGCCGG AGGGGAAGAA GGAGTTCGGC CGCGTGCTCG GACTGAACGT GGCGCTCCTC GTATTCTTCC TCACCACGGT CGCCATCTCC CTTGTCCCGC TGGCCAACTG GTCCAAGGAC ACCAACCGCG GCGTGAACAG CGGCGCCAAC GTGGTTGCGG AGGCCGGCAA GACGGAGGCA AAGGGTGGGC TTAACCGCGA GGAGGCGATG TCCTGGTCGA TGCCGCCGGA AGAGACCGCC GCCTTCATCA TCCCCGGGAT GTTCGGCTTC TCCCGCCAGG AAGCGGGGGA GAACCCGAAG AACATAGACG CTTACTACTG GGGGCGGATG AACTTCACCC AGACGGTGAG CTACATGGGG CTTCTCCCCT GGCTCCTCTT GCCGCTGCCT CTTATCTTCC GGCGCGACCG TTACACTTGG CTCGCGCTCT CGGCCGTGGT GGTGGGGATC TTCTTCTCCA TGGGCAAATA CACCCTTTTC TACAACCTTC TCTTCGACTA CTTCCCGGGG ATCAACCGCT TCAGGGTTCC GAAGATGATG ATGTTCATCC CTGTGCTGGG GCTCGGGGTG CTCTCCGCAC TGGGACTCGA CCTGCTGCTG GACCCGGTGG TCCGCGCCAC CCGCGCCTTC AAGCGCTACA TACTTGGCAT CGTCCTCTTG CCGGTGGCGC TCCTGGCGCT CTTGGGGACG GAGATCGCCG CCGGGCAGTT CTGGGTCAAC ACCTTCATCG ACATCCTTTC CCAGCCGACC CGCTACCAGT CCCAAAGCGA GCAGCTCGTG CTGGAGCGCT GGAACAACCT GGTCGCCGAA ACCGCCATAG CCGCAGGACT TGCCGCTCTT TTCGCAGCGG CGTTCGCCCT GTACCACCGC GGCAAGCTCG CCGCGAAATT TCTTCCCCTG GTGCTGATCG CGCTGTTCCT GCTGGACGTG GGGCGCGTCA ACTCCAAGTT CCTCTTCCTC GTGGAAGAGC CGCATAAGGC GACCGCCGTG AAACCGCCGG AGATCGCTTT CCTCGCCAAT CAGCCCAAGG AGTACCGCGC GCTTCCCATG GGCGGCGACC CTATGCCGTA CGTGGCTTCC GGGATTCCGG TGATGTTCAC CTCGAACCCG GTGCAACAGC GCCGTTGGAT GGAGTACCTG GACAACTTCA ACCTCCTCTC CAGCATGCCG GACATCCTCA ACGTGAGATA CCTGGTGGTG ACCAAGGACC AGTACCGGCA GGACCAGGCC GGCATGGGCA ACAAATACCG CCCCGTTTTC ACCACGCCTG ACGGCGGCAC CATCATCCTC GAAAACCAGA ACGTTCTTCC CAAGGCGTGG CTGGCGCCTG TCGCGTTGAA GGCGGCCTCG GCACAGGAGT CGCTCATGGC GCTGCAGAAT CCGGCGTTCA ACCCGAGGCT GATGGCGGTG GTTGAGTCGG AGCCCCCCAT CCCGTTGGCG CCCCCTACCG CCCAGATCAC CGCGACGCCG GGACAGGTGC GCGTGGTGCG CTATGAAGGG GAGCGGATCG ACCTGGATGC CTCGGTCGCC ATGAACTCCC TGCTGGTTCT GGGAGAGAAG TACTATCGGG GTTGGCGCGC CACGGTGGAC GGTAAGGTCG CCGAAATCTA CCCGGTGAAT CACGTGCTGC GCGGCATTTA CCTCACCCCG GGGATGCACA AGGTCGAGTT CGTCTTCGAT CCGCTCCCCT TCAAGATCGG CAAGTACCTG ACCCTGGTCT CCTTTGCCGT CTTCGCCGTC TTCCTCGGGC GCGAGGTCGT GCTCAGACGG AGGCAGCAGG CCAAGGGTGC TGAGTCATGA
|
Protein sequence | MTDRKKDLLI LSGLLAVLLI LFSKILFTSQ IIRAPDIINE FYWGIKDFGK QPLLSMFRVD FSSAGWSPLL NSGFTNEGGM ASEQLLVIRN LIFWAIPAPA SVAWYIVAQL FFGAAGAYCL CRLIGAGRPA SFLAGLVFAL APENASLINA GHVMKIATIT FAPWAFYFLE KGFKTRRLIF FLTTAVVLAF QFFHTHWQIA YYTCLAIGVY GIARSLVIVM GAPEGKKEFG RVLGLNVALL VFFLTTVAIS LVPLANWSKD TNRGVNSGAN VVAEAGKTEA KGGLNREEAM SWSMPPEETA AFIIPGMFGF SRQEAGENPK NIDAYYWGRM NFTQTVSYMG LLPWLLLPLP LIFRRDRYTW LALSAVVVGI FFSMGKYTLF YNLLFDYFPG INRFRVPKMM MFIPVLGLGV LSALGLDLLL DPVVRATRAF KRYILGIVLL PVALLALLGT EIAAGQFWVN TFIDILSQPT RYQSQSEQLV LERWNNLVAE TAIAAGLAAL FAAAFALYHR GKLAAKFLPL VLIALFLLDV GRVNSKFLFL VEEPHKATAV KPPEIAFLAN QPKEYRALPM GGDPMPYVAS GIPVMFTSNP VQQRRWMEYL DNFNLLSSMP DILNVRYLVV TKDQYRQDQA GMGNKYRPVF TTPDGGTIIL ENQNVLPKAW LAPVALKAAS AQESLMALQN PAFNPRLMAV VESEPPIPLA PPTAQITATP GQVRVVRYEG ERIDLDASVA MNSLLVLGEK YYRGWRATVD GKVAEIYPVN HVLRGIYLTP GMHKVEFVFD PLPFKIGKYL TLVSFAVFAV FLGREVVLRR RQQAKGAES
|
| |