Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3159 |
Symbol | |
ID | 8138511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3669133 |
End bp | 3670251 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644870764 |
Product | NHL repeat containing protein |
Protein accession | YP_003022944 |
Protein GI | 253701755 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02145] Fibrobacter succinogenes major paralogous domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 119 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGTAA GCGTCTTTCG TTGGCTATCG CGGCACAAAC CGATGAGGGG GGTGGTCGCC GCCCTGGCGC TGCCGTTGTG TCTGCTCTGC GCTTCGTGCG CGACTCTCGA GCCGACGCAG CCGATAGCGG ACCGGCAGGA ACTCCAGTGG CCCCCCTTGC CGCTCATGCC ACGCATACAA TGGGTCAAGG AGATCCGGGA TCCACATGGA GCCGGGATTG AAAAGGGGAT GTGGCGCCGC TTCACCGAAG TCTTCACCGG CGCAGTGGAA AACAGGATAG GCAAGCCGTA CGGCGTCTAC TTCGACGAAC GCGGCAGTCT CTTCGTGGTG GATGTATCCT ATGGCGTCGT CCACGTAATG GACACCAGGG AGAAAAGCTA CTACATCATC GGCGACGCCG AAAAACGCAC CTTCCGCTCG CCCATCGGCA TTACCGAGGA CGATCGGGAC AACCTCTACA TCACCGATTC CGGCGCCGCC GGCATCTTTC GCTACAACCT GACGAAAAAG AAACTGGAGC CGTTCATCAT TTCGGACCTG GCTCGCCCCA CCGGCATCGT CTTCAATAAA AGCAACCGCC TTCTCTACAT CACCGACACC ACCGAACATC AGGTGGTTGT CTTCGATCTG AAGGGCAACC TCCGTTACCG CATCGGCAGC AGGGGAAGTG CCGCTGGGCA TTTCAACTAT CCGACCGATA TAAATGTGGA CAACAGCGGG CGACTGTACG TGACCGATGC GCTCAACTCG CGCATTTCCA TCTTCTCCGC CGAGGGGACC CACCTTAACA GCTTCGGCAG GTCAGGGGAC ACGGCTGGCA ATCTGCCGAA AGCAAAAGGG GTCGCCGTCG ACAGCGCAGG CAATATCTAC ATAGTCGACG CCCTGCTGGA CGCGGTGCAG ATATTCGACC AAAGCGGCGT GCTGCTGCTG ACCTTCGGCA GCAACGGCAC CAATGCAGGG GAGTTCTGGA TGCCTTCGGG CATCTACATC GATCGCAACG ACTACATCTA CGTTTCCGAC TCGTACAACC GCAGGATCCA GGTCTTCAAG TACCTCGATG TCAAAGACGC ATCGGCGCGG GCCGTCGAAC CAGCCCGCAA CAAGCAGACG GAGCAGTAA
|
Protein sequence | MAVSVFRWLS RHKPMRGVVA ALALPLCLLC ASCATLEPTQ PIADRQELQW PPLPLMPRIQ WVKEIRDPHG AGIEKGMWRR FTEVFTGAVE NRIGKPYGVY FDERGSLFVV DVSYGVVHVM DTREKSYYII GDAEKRTFRS PIGITEDDRD NLYITDSGAA GIFRYNLTKK KLEPFIISDL ARPTGIVFNK SNRLLYITDT TEHQVVVFDL KGNLRYRIGS RGSAAGHFNY PTDINVDNSG RLYVTDALNS RISIFSAEGT HLNSFGRSGD TAGNLPKAKG VAVDSAGNIY IVDALLDAVQ IFDQSGVLLL TFGSNGTNAG EFWMPSGIYI DRNDYIYVSD SYNRRIQVFK YLDVKDASAR AVEPARNKQT EQ
|
| |