Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2207 |
Symbol | |
ID | 8137543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2577909 |
End bp | 2578880 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644869822 |
Product | protein of unknown function DUF1568 |
Protein accession | YP_003022017 |
Protein GI | 253700828 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1943] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 6.45311e-16 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCTCGTC AAGCTCGCTT AGATGTTCCA GGACTGATCC ATCACGTGAT GGCGCGCGGT ATTGAAGGAC GAGAAATCTT CCGGTTGAAC AAGGAGCGGG ATGCGTTTCT TGAACGGCTG GCAGAGATGG CCAGCGAAAA GGGTGGGCCG ACGGTATATG CCTGGACTCT CATGTCCAAT CACTTCCACT TATTGATCCG TCCGGCAGAG ATGCATCTTT CTACTCTTAT GCAACGTTTG ATGACCGCCC ACGCGATAAA TTTCAATAAG AGGCATAAAA GAACCGGGCA TCTCTTTCAA AACCGGTACA AAAGCATCGT CGTCGAGGAA GATGTCTATT TCCTTGAACT CGTTCGTTAT ATTCATTTGA ATCCGATTCG TGCTGGCATC GTGACAGGTC TCAGTGCGCT GGATAAGTAT CGGTATGCAG GGCACAGCGT AATCATGGGA AAACGCGGTT ACCCTGTGCA GGAGGTAGAT GGGGTGTTGT CGTGGTTTTC GGATAACAGG AAAACGGCGC TTGCAAAGTA CCGCGAATTC GTAGACGCAG GACTTGATCA AGGAGAACGG GAAGACCTTC GTGGCGGTGG ACTAATCAGG AGTGCCGGCG GAGTGGTGGC TCTTCTATCA AGAGGTCATG ATGCACATGA GTCTGCAGAC GAGAGAATTC TTGGGAGTGG TGAATTCGTA GATTCTGTCT TAAACGCTAA GAATGTGAAA TCTGCGATTT CGCTTATCGA TGGAATCCTG AGCGAGGTTT GTTCCAGGAG TGGTATATCT GCTCAAAGAA TCCTTGGCCC AAGTCGAGAC CGTAAGGCTT GCAAAGCACG CGTGGAATTT TTTCGTCGAG CGCAGGACGA AGCGGGGACA ACAATTTCAG ATTTGGGTCG CATGACAGTA CGGTCTCATG TTGCAGTGCT AAGAGCTCTT GAACGTGCTG AACATGAGAA GGGTCTGGGA TCTGAACATT AA
|
Protein sequence | MPRQARLDVP GLIHHVMARG IEGREIFRLN KERDAFLERL AEMASEKGGP TVYAWTLMSN HFHLLIRPAE MHLSTLMQRL MTAHAINFNK RHKRTGHLFQ NRYKSIVVEE DVYFLELVRY IHLNPIRAGI VTGLSALDKY RYAGHSVIMG KRGYPVQEVD GVLSWFSDNR KTALAKYREF VDAGLDQGER EDLRGGGLIR SAGGVVALLS RGHDAHESAD ERILGSGEFV DSVLNAKNVK SAISLIDGIL SEVCSRSGIS AQRILGPSRD RKACKARVEF FRRAQDEAGT TISDLGRMTV RSHVAVLRAL ERAEHEKGLG SEH
|
| |