Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1629 |
Symbol | |
ID | 8136960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1896772 |
End bp | 1898631 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644869242 |
Product | hypothetical protein |
Protein accession | YP_003021442 |
Protein GI | 253700253 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 164 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACATGA CCGGAACCCT CAGAAGGCTC GACCCCTTTT CCATAGCGGC CTTCGTCATC CCTTTGTTCA TCTACCTTTT GACCCTGGCG CCGTCCGTCA CCTTTTTCGA CAGCGGCGAG TTCATCACCG CCATCGCTTC GCTGGGGACC GCGCACTCCC CCGGATACCC CCTGTTCATC AACTACGCCA AGCCCTTTAC CTTTCTCCCC TTCGGCAGCA TCGCCTTCAG GGTCAACGTG GCCACCGCCA TCTCGGCTGC CGCCGCCTGC TACGGCGTCT ATTTCCTGAC CATGGTTTTC CTCAAGGACG AGCAAGGGCC TTGGCAGGCT CGGCTAGGTG CCCTCCCCGC GAGGCTCTGC GCCCTGAGCG CCGCGCTCAC CTTCGCCTGC ACCGCGAGGC TCTGGCTGCA GTCGAACCAC GACAAGCCGT ACCCGCTGCT GGCCTTCATC TGCGTCATCG TCTTCTGGCT CATGCTCCTT TGGCGCGACA GCTACCTGGA GGGGCGCGAG CGCCCTTCTT ACGTCTACCT CGGGGCTTTC CTGGTCGGGC TCGCCACAGG GGCGCACCAG ATCATCGTGC TGATGATACC GACCTACGCC TGGCTCCTCA TCTCGGCCGA CCGGCGCGTC GTCTTGCGGA TGAAGGAGTT CCTCATCGCC TTCGCCTTCG GCCTCTTGGG TTTCGCCATC CACCTGCACC TGGTGGTGCG GGCGCTGCAA AAGCCGTTGC TCAACTGGGG GGATTCGAAG AACCTCACCC AGTTTCTCTG GAACCTGCTC AGGAAGGGGT ACCCGGTGGA GAAGCCGCCC CGCGACTTCG GGCTTTTGTG GGCACAAGTG AACGCCTTCA ATATCCCATT TGAATTCACC ATGTTCGGAA TGTTGCTGCT GGTGGTGGGG CTGGCCGGTT TCGCCCTCAC CAGGAAGAGG TACCTGGTCC TGGGGTACCT GATCGCGCTC GCCTCGTTTC TGCTGGTCAT CGTCGGCTAC TTCAACACAC CCGGCGAGAT GATCTTCCTC ACCGAGGAGT TCTTCACCCC GCTCTACCTT TTGAGCGCCG TCTTCATCGG GGTCGGGCTC TTCATCATGG TCAGGGAGAT CGGGAACCAC CTTCCCGACC GCGCTGCGCT GCAGGGGGGG ATTTTTCTCC TTCTCTTCGC GCTTCCGGTC GGCGTCTGCG CCATGAACTA CCGCGAGAAC GACCAGCACC AGAACTACAT CGCCTTCGAC TACGCCTCCA ATACGCTCAG GTCTCTGCCG CAAAACGCCG TCATGTACAC CTGGGGGGAT TCCGGGGCTT TCCCGCTTTG GTACCTGCAG GGGGTGGAGC GGATGCGCGA GGACACAGCC CTGCCGCATA CCCCGCACCT GGTCTTCGAC TGGTACCTGG ACAGCATGCC TGTGCTTTTC CAGGGCTCGC GCCTGTACGC CTTCCCTCCG GACCAGCGTT ACCCGGAGAA CACCCTCCTC ACCTCTGTGA TGGAGCAGTA CGCGCAGCGC CCGGTCTACA TCGACTTCTC CACCAGGTAC TCGGTCGCGT TCCAGAACTT CCAGTTGCAC CAGCGCGGCA TCATCTTCCG GCTGGATCCC GCCAGCCTCC CCCCCGTTCC TTCCGACTCC GACGTCTGGG GGCTCTACTC CACACGGGGG CTCCTGGGGG AAAGCGACAT GTTCTTCCGC GACCTGGATA CCGGCAAGGC GATCCTCATC TACGGCGCGG CGCTGGTCGA ATCAGGGGAG ACCCTGCTGA AGCTGGGGCA AAAAGCTGCG GGGGTGTGGG CGCTGGAGCT GGCGGGGCGG GTCTCGCCCG AGGCAGGGCA GCAGGCGCAG CAGATTCTCA GAAGCTACGG GGTGCGCTGA
|
Protein sequence | MNMTGTLRRL DPFSIAAFVI PLFIYLLTLA PSVTFFDSGE FITAIASLGT AHSPGYPLFI NYAKPFTFLP FGSIAFRVNV ATAISAAAAC YGVYFLTMVF LKDEQGPWQA RLGALPARLC ALSAALTFAC TARLWLQSNH DKPYPLLAFI CVIVFWLMLL WRDSYLEGRE RPSYVYLGAF LVGLATGAHQ IIVLMIPTYA WLLISADRRV VLRMKEFLIA FAFGLLGFAI HLHLVVRALQ KPLLNWGDSK NLTQFLWNLL RKGYPVEKPP RDFGLLWAQV NAFNIPFEFT MFGMLLLVVG LAGFALTRKR YLVLGYLIAL ASFLLVIVGY FNTPGEMIFL TEEFFTPLYL LSAVFIGVGL FIMVREIGNH LPDRAALQGG IFLLLFALPV GVCAMNYREN DQHQNYIAFD YASNTLRSLP QNAVMYTWGD SGAFPLWYLQ GVERMREDTA LPHTPHLVFD WYLDSMPVLF QGSRLYAFPP DQRYPENTLL TSVMEQYAQR PVYIDFSTRY SVAFQNFQLH QRGIIFRLDP ASLPPVPSDS DVWGLYSTRG LLGESDMFFR DLDTGKAILI YGAALVESGE TLLKLGQKAA GVWALELAGR VSPEAGQQAQ QILRSYGVR
|
| |