Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3956 |
Symbol | |
ID | 8139330 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4538994 |
End bp | 4540103 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644871572 |
Product | hypothetical protein |
Protein accession | YP_003023730 |
Protein GI | 253702541 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0000000641904 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTCCTG AAATTGTTCC CTTGATCCAG CTCCAGGGAG ACCTGGGGGA AGGGGTCGGC ATCATCGATC CGCTGCGAAA CCACCGATGG GACCGTTTCG TGGAACGGCA TCCCTTCGGA TGGCTCACCC ACCTCTCCGG ATGGAAGCGG GTGCTGGACG AAACCTTTCC CCACCTCCAC GGCTATTACC TTACCATCCA GGACCCGGCT GGCGGCATCG TGGGGGCGCT CCCCATCTAC TCGGTCGATA GCTGGCTCAT GGGAAAACGG CTGGTCAGTA TCCCCTTCGC CACCTTGTGC GACCCTTTGG TTGCCGATGC CAAGGAGATG GAGCGGCTGT TTCGGGCAGC CGTCACCCTC TCGAGGAGGG TCGGCATCCC CCGTATCGAG ATCAAGACTC TCTGCGCCTC TCATTTGCTG GACAAAGAGA GCTTTATCAC CGATTGCGGC TACAAGCGCC ATTTCCTCCA TTTAGGCGAC GACCCGGACC GGATCAGGAA GACCTTCCAT AGGAGCTGCG TCAGGCAGCG CATAGCCCGC GCGGAGCAAA GCGGGCTGCG GCTGGTGAGG GGGGCATCGG AGTCCCACCT GAAGCAGTTC TACCTGCTGC ACCGACAGAC CCGCAAAAGG AAGGGGCTCC CGCCGCATCC CTACCGGCTG ATCAAGTCGC TGGCCAACGC CTTTGCCGGA AGCAACAAGG TGGAGCTTTT GCTTTGCTGC AAGGAGGACG AGGCGGTGGC GGGAGTGATT GTCTTCAAGT ACAAGGACCG GGTTTCGGTG GAGTATTCCG CCGTTAACGC CGCCTACAAC GAGTTGAGTC CGGTGCATCT GCTGTTCTGG AACACCATCA AGAATGCCTG TCTCTCCGGC TACCGCATCC TCGACTTCGG ACAAACCTCT ATCCACAACA AAAGCCTGAT GGAGTTCAAG TCACACTGGG GGACCGAGGT CTCCGACCTG CCGCACTTCA TCTACCCGAA CGACCCCGCG CTGAACCTTG CCGCCTACCA GGACACGCTG GGGAAGAAAC TGCTGCAATA CGTCTGCAAC AAGGCGCCGG ACCCCGCACT CACCTACCTC GGCGACTTCT GCTACCGGCA CCTCGGGTGA
|
Protein sequence | MAPEIVPLIQ LQGDLGEGVG IIDPLRNHRW DRFVERHPFG WLTHLSGWKR VLDETFPHLH GYYLTIQDPA GGIVGALPIY SVDSWLMGKR LVSIPFATLC DPLVADAKEM ERLFRAAVTL SRRVGIPRIE IKTLCASHLL DKESFITDCG YKRHFLHLGD DPDRIRKTFH RSCVRQRIAR AEQSGLRLVR GASESHLKQF YLLHRQTRKR KGLPPHPYRL IKSLANAFAG SNKVELLLCC KEDEAVAGVI VFKYKDRVSV EYSAVNAAYN ELSPVHLLFW NTIKNACLSG YRILDFGQTS IHNKSLMEFK SHWGTEVSDL PHFIYPNDPA LNLAAYQDTL GKKLLQYVCN KAPDPALTYL GDFCYRHLG
|
| |