Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_4012 |
Symbol | |
ID | 8139386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4597573 |
End bp | 4598610 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644871628 |
Product | General secretory system II protein E domain protein |
Protein accession | YP_003023786 |
Protein GI | 253702597 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 3.66345e-31 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTCGGCAC GGCTTGGCGA GATGTTGCTG AAGGTTGGAA CCCTGACGGA GGACCAGCTG GAGCAGGTGC TTAACGCGCA GTCCATCTAT GGCGGCAGGC TCGGGACAAA TCTCGTGGAG ATGGGGTTAG TCGAAGAGGA GGAATTGGCG CGCCTATTGA GCGAGCAGCT TGGTGTACCC TGCGCCCACC CCTCAGAACT CAGTTCCATT CCGGAATCCC TATTGAAGAT GTTCCCGCTG GAGCTGGTGC AGCGCTACCG CGTGCTCCCC CTCGCTCTGG ACGGCAAGCG GCTCACCGTG GCCATGACGA ACCCTTCGGA TTTCAAGGCG CTCGAAGACA TCGCTTTTGT TACCGGGATG ATCATCATCC CAAGGGTCTG CTCCGAACTC CGGTTGAGCA TCGCGCTGGA GCGAATCTTC GGGGTAAAGC GCCCCATGCG TTACATCCCT GTGGAGGGAG GTGCTAGGAG CCGCTTCGCC GCCACCCTCG CTGAGCGGGG GAGCGCGGAC CCCGCCTGGG ATGGCGGCGC AGTTTGCCAT ACTTCCGAGC GAGTCAGTCT GGAGGATCTG TCCGAACGCC TGGCAAAAGC CGTCGGCGAG TCCGAGGTTG TCCAGGCTGT TCTGTCCTAT CTGGCGGGTG AATTCGACCG GGGTGCCTTC CTGAGGCTGA AAGGGGGGTG CGTGCACGGG GTTCAGGCAG TGGAGGCCGG CTCACCGGTG AAAGGCTTCC CGTTTTTTGC CGCGGCGATG GCTGACACGA GGCAGTTGAA ACGGGTGGTC GAGGAAAGGC GGCTCTTTCT AGGTGAGCTG GAACCGGATC AGGGCGAAGG GCTGTTGCTG AGGGCGATGG GGGGTAAGGT TCCCGGGTCG GCGCTGCTGG TGCCTGTGGC GCTTGGCGGG CAGGTGGTGG GGGTCATCTG CGCCAGCGAT CAGAGGGGGC GACTCGGCGG TGGCGTCTTC GAGCTGCAGC GGGTCGCGGT GATGGCAGAG TTGAGCTTCG AGATGCTGTC GCTCAAGAAA AGGATCATGA CCGTGTGA
|
Protein sequence | MSARLGEMLL KVGTLTEDQL EQVLNAQSIY GGRLGTNLVE MGLVEEEELA RLLSEQLGVP CAHPSELSSI PESLLKMFPL ELVQRYRVLP LALDGKRLTV AMTNPSDFKA LEDIAFVTGM IIIPRVCSEL RLSIALERIF GVKRPMRYIP VEGGARSRFA ATLAERGSAD PAWDGGAVCH TSERVSLEDL SERLAKAVGE SEVVQAVLSY LAGEFDRGAF LRLKGGCVHG VQAVEAGSPV KGFPFFAAAM ADTRQLKRVV EERRLFLGEL EPDQGEGLLL RAMGGKVPGS ALLVPVALGG QVVGVICASD QRGRLGGGVF ELQRVAVMAE LSFEMLSLKK RIMTV
|
| |