Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1687 |
Symbol | |
ID | 8137018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1966142 |
End bp | 1967578 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644869299 |
Product | hypothetical protein |
Protein accession | YP_003021499 |
Protein GI | 253700310 |
COG category | [R] General function prediction only |
COG ID | [COG3497] Phage tail sheath protein FI |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 102 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACCA TCTACAAGAC ACCGGGTGTC TACATCGAAG AAATCCCGAA GTTTCCCCCT TCAGTCGCCC CGGTCGAAAC CGCGATTCCC GCGTTCATCG GTTACACGGA GAAAGCGGAC AACTTGGTCC CAGGCGACCT CACCTTGAAG CCGACGCGCA TCTCCTCGCT GGTCGAGTAC GAGAAGATGT TCGGCGGTGC GCAAAAGGAA GAGGACATAA ACGTCGACGT CCAGGAAACT CAGGTCAACA GCGTGACGGT GGATCTGAAG GCAACGGCCA CGGTCACCGA AGCGGCCCGC TCGAAGCACA TCATGTATTA CGCGATGCAG CTCTTCTTCG CCAACGGCGG AGGCCCCTGC TACATCGTCT CGGTCGGCCC ATTCAAGGCG ACCTTCGGGG GCGCGCTGGT GGAAACGGAG CTTCAGGCCG GGCTGGACAC CCTGGTGAAG AAGGACGAGC CGACACTGAT CGTCTTCCCG GAGGCCCAAA GTCTGTCGAT CGCGGACTTC AAGACCCTGC ACGACGCAGC CCTCGCCCAG TGCGCCGATC TTAAGGACCG GTTCGTGATC ATTGACGTCC ACGGCGACAG CATCTCCCTG TCGGACCCTA ACGGGAACCT TTTGACCGCC GTCTCCAACT TCCGGACCAA CGGCATCGGG ATGAACAACC TGAAGTACGG AGCGGCCTAC GCGCCCAACA TCGACACCGT TCTAGACTTC CAGTACGACG ACAGCAAGGT CGACGTCACT ATCACCACCA ACGGCACCGC TGCGGCTCCG GTGAAGCTCG ACACGCTGAA AACCGCCAAC AACCGCATCT ACGAGCAGGC CAAGGCCGGC ATCAGGGACA TGATCTGCAA GATGCCTCCC TCCTCGTCAA TGGCCGGGAT CTACGCCGCC GTAGACAACA GCCGCGGAGT CTGGAAGGCG CCGGCAAACC TGAGCGTCAA TTCGGTGATC CAGCCGAGCA TCCAGTTCTC CAGCGTGGAG CAGGACCAGA TGAACGTCGA CCCGGTCGCC GGCAAGTCGG TGAACGCCAT CCGCGCCTTC ACCGGGAAGG GCACCCTGGT GTGGGGGGCG CGAACGCTAG CCGGCAACGA CAACGAGTGG CGCTACGTCA ACGTGCGCAG GCTCTTCAAC TTCGTGGAAG AGTCGTGCAA AAAGGCGACG GAACCTTTTG TCTTCGAACC GAACGACGCC AACACCTGGG TCCGGGTGCA AGGGATGATC GAGAACTTCC TTACGGTGAT CTGGAGACAG GGGGCGCTGC AGGGGGTTAA ACCCGAGCAC GCCTTCTTCG TCGCCGTCGG ACTGGGGAAG ACCATGACGG CCATCGACAT CCTGGAGGGG CGCATGATCG TCGAGGTCGG CTTGGCGGCG GTGCGACCAG CGGAGTTCAT CATCCTCAGG TTCTCGCACA AGATGGCTGA ATCCTAA
|
Protein sequence | MATIYKTPGV YIEEIPKFPP SVAPVETAIP AFIGYTEKAD NLVPGDLTLK PTRISSLVEY EKMFGGAQKE EDINVDVQET QVNSVTVDLK ATATVTEAAR SKHIMYYAMQ LFFANGGGPC YIVSVGPFKA TFGGALVETE LQAGLDTLVK KDEPTLIVFP EAQSLSIADF KTLHDAALAQ CADLKDRFVI IDVHGDSISL SDPNGNLLTA VSNFRTNGIG MNNLKYGAAY APNIDTVLDF QYDDSKVDVT ITTNGTAAAP VKLDTLKTAN NRIYEQAKAG IRDMICKMPP SSSMAGIYAA VDNSRGVWKA PANLSVNSVI QPSIQFSSVE QDQMNVDPVA GKSVNAIRAF TGKGTLVWGA RTLAGNDNEW RYVNVRRLFN FVEESCKKAT EPFVFEPNDA NTWVRVQGMI ENFLTVIWRQ GALQGVKPEH AFFVAVGLGK TMTAIDILEG RMIVEVGLAA VRPAEFIILR FSHKMAES
|
| |