Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3071 |
Symbol | |
ID | 8138421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3559983 |
End bp | 3561122 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644870675 |
Product | hypothetical protein |
Protein accession | YP_003022857 |
Protein GI | 253701668 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.000000174945 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACCGAT ACCGCATACA TTGCCTGCCG GGCCTCCTGC TGGTGGGGGC ACTTTGCTTT GGCGCGCCCG CTTTGGCCGC GACGCCCACA GCCGTCGCTG CGCCGGGCGC CGCCGAGACG ATGCGACCAA GGACCGAAGC GGAGTTCACC CCCACTTTTC GCTACGGCGA CCGTGTGCTC ACCGAGGACA CCGTCTGGAG AGGGGTGGTG CTGGTGGAAG GGGCGGTGAC CGTGGCGCCT CAGGCGACCC TGACCGTCGA GCCCGGGACC GTGATTCGCT TCAGGGGGGA TGACGCCTCC GGCGCAGTGC TGGTGGTGCA GGGGAGGATG GCGGCTGCCG GAACAAAGGA ATCCCCCATC GTTTTCACCT CCAGTTTTGC CGTACCTGCC GCAGGGGACT GGCAGGGGGT GATGCTCCTG GGGAGCGAGA AGAGAAACGT CCTTGAGAAC TGCCGCATCG AGGCCGCGCA GACCGGACTT GAAGCTATTT TCTCCAACCT GACGCTGAAG AACGTGCGGG CCGAGCGGAG CAAGGCCGGG ATGAGGTTTC AGGACGCCCT GGTCGTGATG GAGGGAGGCG GGACCAGCGA TTGCGATACC GGCCTCAACT TCTCCGAGAG CGAGGCGACC TTGCGCAACC TGAACCTGAT CGGAAACCGC AAAGGGCTCG TCGCCCAGCG CAGTTCCATT TATCTGCAGG AGGGAAGCTT TTCCATGAAC GGCTCCGCCT TCTCGTGCGA CAGCTGCCGG GTCAGGCTGC AGGGGGGAGG GGTGTCGGAC AACGGCAGGG GAATCACCCT GTACGAGAGC GAAGGGTCGG TCACCGGCGT TGAGGTGGCG CGCAACAGCG ACTACGGCAT TTCGCTCGCC ACCTCCCGGA TAAGGATCAC CGGGAACCAG ATCACCGGCA ACGGCAACAG CGGCCTTTTG GTCTTCGATG CCTCTTCCGT CGCCTGGGAC AACGCCATCC ATGACAACGG CTACGACCTT TACAACGCCG GCAAGGAGGA GTTCCGGGCG CCGGGCAACT GGTGGGGGGC GGCCGGGCCG AAGATTTACG ACAACGGGGG AGCCGGGAAG GTCCTCTCCA CCCCGCGGCT CACAGCACCG CCTGAAGCAG GTTCTAAAGA TAAACCCTAA
|
Protein sequence | MNRYRIHCLP GLLLVGALCF GAPALAATPT AVAAPGAAET MRPRTEAEFT PTFRYGDRVL TEDTVWRGVV LVEGAVTVAP QATLTVEPGT VIRFRGDDAS GAVLVVQGRM AAAGTKESPI VFTSSFAVPA AGDWQGVMLL GSEKRNVLEN CRIEAAQTGL EAIFSNLTLK NVRAERSKAG MRFQDALVVM EGGGTSDCDT GLNFSESEAT LRNLNLIGNR KGLVAQRSSI YLQEGSFSMN GSAFSCDSCR VRLQGGGVSD NGRGITLYES EGSVTGVEVA RNSDYGISLA TSRIRITGNQ ITGNGNSGLL VFDASSVAWD NAIHDNGYDL YNAGKEEFRA PGNWWGAAGP KIYDNGGAGK VLSTPRLTAP PEAGSKDKP
|
| |