Gene GM21_0555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0555 
Symbol 
ID8135866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp680089 
End bp681693 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content65% 
IMG OID644868168 
Productamino acid adenylation domain protein 
Protein accessionYP_003020387 
Protein GI253699198 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.000000001061 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTACCTAC TGCAACGGTT ACTGACCCGA AGCGCCGCCG CCTTCCCCGA CAAAACCGCG 
GTCTCCTTCC GCAACCAGGA GCTCTCCTAC GCCGAGCTCC AGGCGCAAAG CAACCAGCTA
AGCGCGCTCC TCAAGGGGCA CGGCGTGAAG CGCGGAGACC GGGTCGGGAT CCTTTTGAAC
AAGTCGCTTG AGTCCATCGT CTCGGTGTTC GGCATCCTCA AGGCCGGCGC CACCTACGTC
CCCCTGGATC CGGCAGCACC TGCCGCCAGA CAGGCCTCCA TCATCAGGCA CTGCGGCATC
GAGACGCTCC TCGCCGCACC GCAACTGCTG GAGCGGCTTT TGGCTGAGGC AGGGGAGGCC
CCGCCGCTGC GCGCAGCCAT CGTCACCGGC TCCCCGGCGG CGGCACTCCC ACACCCGGCC
GGCAGCATGA GCTGCAGCGG CTGGGACGAG ATCCTGGGCG AGAGCTGCGA GGTCCCGGCG
AACGACGGGC TTTGCGGCGC CGCTCCCGCC TATATCCTGC ACACCTCAGG TTCCACCGGC
GCCCCCAAGG GTGTGGTGAT CTCCCATCTC AACGCCCTCA CCTTCGTGGA GATGGCGGTC
CGCTTCTTCG AGATCTCGCC GCGGGACCGC CTGGCCAATC ACGCGCCGCT GCATTTCGAC
CTCTCCATCT TCGACATCTT TTGCGCCGTC AGGAGCGCTG CAACCATGGT GCTGGTTCCG
GAAGCGCTCT CGGCATTCCC GGTGCGCCTG GCGGATTTCA TGCAGAGCGA GGCGATCACC
GTGTGGAACT CGGTGGCGTC GCTTCTCACC AAGCTTGCGG ACCAGGGGGC GCTGGACCGG
CTCACCCTGG AAAAGCTGCG CCTGGTCCAC TTCTCCGGGG ACCTGATGCC GGTCAAATAC
CTGAAGATCC TGAAGCGGTG CATGCCGGCT GCCGTCTTTT ACAACATCTA CGGCCAGACC
GAGGCCAACT CCTCTCTCTA TTTCAGGGTC CCGGATGTCG TGGAGGAAGC GGCCTGGAAG
ATCCCGATCG GGACCCCCTT CCCCAATTTC GAGGTGTTCG CCGTCGACGA GGGGGGGAAC
GTGGTGACCG GGGCGGGAGA GGAGGGTGAG CTGCACGTCC TCAGCTCCAC CGTGGCTCTC
GGCTACTGGA ACGACTGCGA CAGGACGAAG GCGCAGTTCA CCCCGGACCC GCGCAACCCC
GCCGCCCACG CCAGGGTGTA CAGGACCGGT GACATGGCGC GCCTGGACGC CGCCGGCAAC
TTCGTCTTCG CCGGCCGCAA GGACCACATG GTGAAGAGCA AGGGGTTCCG GGTGGAGCTG
GACGAGATCG AGATCGTGCT GAACAGCGAC CCCGGCATCC GGCAGGCGGC CGTGGTGGCC
ATCCCCGACG ACCTCGCCGG AAGCAGGATA GTCGCCTACG TATGCCTGCG CGAAGGGGTC
GAACTTAAGC CGCAAAGGCT CGTCGGGCTT TGCGCCGACC ATCTCCCGAA ATACATGGTG
CCGGAACAGA TCAGGTACCT CCCCTCCCTG CCGGTGACCT CCAGCGGCAA GATAGACCGC
AACGCCCTGG TGCAGGCGTT TCTCTACGGG CCTGCCAAGC GATAA
 
Protein sequence
MYLLQRLLTR SAAAFPDKTA VSFRNQELSY AELQAQSNQL SALLKGHGVK RGDRVGILLN 
KSLESIVSVF GILKAGATYV PLDPAAPAAR QASIIRHCGI ETLLAAPQLL ERLLAEAGEA
PPLRAAIVTG SPAAALPHPA GSMSCSGWDE ILGESCEVPA NDGLCGAAPA YILHTSGSTG
APKGVVISHL NALTFVEMAV RFFEISPRDR LANHAPLHFD LSIFDIFCAV RSAATMVLVP
EALSAFPVRL ADFMQSEAIT VWNSVASLLT KLADQGALDR LTLEKLRLVH FSGDLMPVKY
LKILKRCMPA AVFYNIYGQT EANSSLYFRV PDVVEEAAWK IPIGTPFPNF EVFAVDEGGN
VVTGAGEEGE LHVLSSTVAL GYWNDCDRTK AQFTPDPRNP AAHARVYRTG DMARLDAAGN
FVFAGRKDHM VKSKGFRVEL DEIEIVLNSD PGIRQAAVVA IPDDLAGSRI VAYVCLREGV
ELKPQRLVGL CADHLPKYMV PEQIRYLPSL PVTSSGKIDR NALVQAFLYG PAKR