Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0465 |
Symbol | |
ID | 8135774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 571710 |
End bp | 572693 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644868083 |
Product | NMT1/THI5 like domain protein |
Protein accession | YP_003020303 |
Protein GI | 253699114 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 0.0409028 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAGAG CCGCAATTGT CCTGATTTTG TCATTGCTGA TCCTCGGTTG CCGGTCGAAG ACCCCTGCCG AGCCAGTTAA GCCGCTTGCA CTGCGGCTTG CCTACACCAC GAAGCACCAT TCCGCGCTGG TGCACCTGGC TGCAGCCAAG GGGTATTTTC AGGCGGAGGG CGTGCTGTTG CAGCCGATGC TGTTCGAATT CGGGAAGCAG GCCTTGGCCG CTGTCTCCGA GGGAAAGGCC GATCTGGCGA CCGTGTCCGA GACTCCTTTC GTCCTTGCCG CGCTGAACGG CCAGCGGCTC TCCCTGGTCG CCAGTATCTT CACCTCGTGG AAGAACAACG GCATCGTCGC CAGGGGGCTC GCGAGCCCGC GCGACCTGCG CGGGAAGCGG ATAGCCTATA CCCCGGGCAC GACGTCGGAA GTGTTCCTGG ACTCTTTCCT GATGTCGCTG CGGATCGCAA GGCAGGAGGT GACCCTGGTG GCTTTGGCCC CGCAGCAGAT CCCGGCTGCG CTCGCCGCAG GCGAGGTGGA TGCGGCCTCC ACATGGAATC CCACTCTGAA GGAGGCTGCA ACAGGACTGA GGGGCGCCGG GAGCGTGTTT TTCGACCCCT ACCTCTACAC CGAGACCTTC GTCCTGGCGG GAAACCGCGG CTACGTCGAC GCCAATCGGG AGCTCATGCA GCGCGTGCTG CGCGCCCTGC TTAAGGCGGA GGCTTATGCT TCCAAGCACC CGGCGGAGGC CCGGACCCTG ATGGCTGACG CCATGAAGCT GACCCCTGAG CTTCTCGCCG AGTTCTTGGA CGAGAGCAGG CTCAGGGTCT CCCTGGACCG CTCCCTGCTT CTGTCGCTCG AGGAGGAGAG CCGGTGGGCT TTGAGGCGCA AGCTGGCCCC GGAGGGGGCC ATCCCCAATT ACCTGGAGTA CATCGACGTC AGACCGCTGC AGGAGGTGAA GCCTGAAGCG ATAGAGATCA ACATCAGAAG GTGA
|
Protein sequence | MRRAAIVLIL SLLILGCRSK TPAEPVKPLA LRLAYTTKHH SALVHLAAAK GYFQAEGVLL QPMLFEFGKQ ALAAVSEGKA DLATVSETPF VLAALNGQRL SLVASIFTSW KNNGIVARGL ASPRDLRGKR IAYTPGTTSE VFLDSFLMSL RIARQEVTLV ALAPQQIPAA LAAGEVDAAS TWNPTLKEAA TGLRGAGSVF FDPYLYTETF VLAGNRGYVD ANRELMQRVL RALLKAEAYA SKHPAEARTL MADAMKLTPE LLAEFLDESR LRVSLDRSLL LSLEEESRWA LRRKLAPEGA IPNYLEYIDV RPLQEVKPEA IEINIRR
|
| |