Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0555 |
Symbol | |
ID | 8135866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 680089 |
End bp | 681693 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644868168 |
Product | amino acid adenylation domain protein |
Protein accession | YP_003020387 |
Protein GI | 253699198 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.000000001061 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTACCTAC TGCAACGGTT ACTGACCCGA AGCGCCGCCG CCTTCCCCGA CAAAACCGCG GTCTCCTTCC GCAACCAGGA GCTCTCCTAC GCCGAGCTCC AGGCGCAAAG CAACCAGCTA AGCGCGCTCC TCAAGGGGCA CGGCGTGAAG CGCGGAGACC GGGTCGGGAT CCTTTTGAAC AAGTCGCTTG AGTCCATCGT CTCGGTGTTC GGCATCCTCA AGGCCGGCGC CACCTACGTC CCCCTGGATC CGGCAGCACC TGCCGCCAGA CAGGCCTCCA TCATCAGGCA CTGCGGCATC GAGACGCTCC TCGCCGCACC GCAACTGCTG GAGCGGCTTT TGGCTGAGGC AGGGGAGGCC CCGCCGCTGC GCGCAGCCAT CGTCACCGGC TCCCCGGCGG CGGCACTCCC ACACCCGGCC GGCAGCATGA GCTGCAGCGG CTGGGACGAG ATCCTGGGCG AGAGCTGCGA GGTCCCGGCG AACGACGGGC TTTGCGGCGC CGCTCCCGCC TATATCCTGC ACACCTCAGG TTCCACCGGC GCCCCCAAGG GTGTGGTGAT CTCCCATCTC AACGCCCTCA CCTTCGTGGA GATGGCGGTC CGCTTCTTCG AGATCTCGCC GCGGGACCGC CTGGCCAATC ACGCGCCGCT GCATTTCGAC CTCTCCATCT TCGACATCTT TTGCGCCGTC AGGAGCGCTG CAACCATGGT GCTGGTTCCG GAAGCGCTCT CGGCATTCCC GGTGCGCCTG GCGGATTTCA TGCAGAGCGA GGCGATCACC GTGTGGAACT CGGTGGCGTC GCTTCTCACC AAGCTTGCGG ACCAGGGGGC GCTGGACCGG CTCACCCTGG AAAAGCTGCG CCTGGTCCAC TTCTCCGGGG ACCTGATGCC GGTCAAATAC CTGAAGATCC TGAAGCGGTG CATGCCGGCT GCCGTCTTTT ACAACATCTA CGGCCAGACC GAGGCCAACT CCTCTCTCTA TTTCAGGGTC CCGGATGTCG TGGAGGAAGC GGCCTGGAAG ATCCCGATCG GGACCCCCTT CCCCAATTTC GAGGTGTTCG CCGTCGACGA GGGGGGGAAC GTGGTGACCG GGGCGGGAGA GGAGGGTGAG CTGCACGTCC TCAGCTCCAC CGTGGCTCTC GGCTACTGGA ACGACTGCGA CAGGACGAAG GCGCAGTTCA CCCCGGACCC GCGCAACCCC GCCGCCCACG CCAGGGTGTA CAGGACCGGT GACATGGCGC GCCTGGACGC CGCCGGCAAC TTCGTCTTCG CCGGCCGCAA GGACCACATG GTGAAGAGCA AGGGGTTCCG GGTGGAGCTG GACGAGATCG AGATCGTGCT GAACAGCGAC CCCGGCATCC GGCAGGCGGC CGTGGTGGCC ATCCCCGACG ACCTCGCCGG AAGCAGGATA GTCGCCTACG TATGCCTGCG CGAAGGGGTC GAACTTAAGC CGCAAAGGCT CGTCGGGCTT TGCGCCGACC ATCTCCCGAA ATACATGGTG CCGGAACAGA TCAGGTACCT CCCCTCCCTG CCGGTGACCT CCAGCGGCAA GATAGACCGC AACGCCCTGG TGCAGGCGTT TCTCTACGGG CCTGCCAAGC GATAA
|
Protein sequence | MYLLQRLLTR SAAAFPDKTA VSFRNQELSY AELQAQSNQL SALLKGHGVK RGDRVGILLN KSLESIVSVF GILKAGATYV PLDPAAPAAR QASIIRHCGI ETLLAAPQLL ERLLAEAGEA PPLRAAIVTG SPAAALPHPA GSMSCSGWDE ILGESCEVPA NDGLCGAAPA YILHTSGSTG APKGVVISHL NALTFVEMAV RFFEISPRDR LANHAPLHFD LSIFDIFCAV RSAATMVLVP EALSAFPVRL ADFMQSEAIT VWNSVASLLT KLADQGALDR LTLEKLRLVH FSGDLMPVKY LKILKRCMPA AVFYNIYGQT EANSSLYFRV PDVVEEAAWK IPIGTPFPNF EVFAVDEGGN VVTGAGEEGE LHVLSSTVAL GYWNDCDRTK AQFTPDPRNP AAHARVYRTG DMARLDAAGN FVFAGRKDHM VKSKGFRVEL DEIEIVLNSD PGIRQAAVVA IPDDLAGSRI VAYVCLREGV ELKPQRLVGL CADHLPKYMV PEQIRYLPSL PVTSSGKIDR NALVQAFLYG PAKR
|
| |