Gene Gmet_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_2020 
Symbol 
ID3740677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp2260126 
End bp2261697 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content62% 
IMG OID637779314 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_384974 
Protein GI78223227 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01733] amino acid adenylation domain
[TIGR03098] acyl-CoA ligase (AMP-forming), exosortase system type 1 associated 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.256098 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTGA GACCGTATCT GCTAGGCCAT CTGCTGGAAG ACACCGCCGC GCGGTTGCCC 
GATAAGGTTG CTGTCAAGCA CCACGACCGG ACCATCACCT ATGCCCAACT GCACGAGGAA
GCGCTCAAGA TGAAGGGGCT TATCCGCGGG CTCGGCATCA AGCGGGGCGA GCGGGTCGGC
ATCTACCTCG ACAAGTCCAT CGAGCAGCTC ACCGCCATGT TCGGCGCCAC CCTGGCCGGG
GCGGTCTTTG TCTTCATCAA CCCCATCCTC AAGAAAGAGC AGATCGAATA CATCGTCAAC
GATTGCCAGA TCCAGCTCAT GATCACCACC AGCGAGCTGT TCCGCAAGAA CCACCTGGCC
GCGCCCGGAA AGCTCATCCA TGTGGACGAG CCGGAGCACG ACCGGGAAGG GCACCCCTGC
TGGCCCAAGC TCAAGGCAAC GCTCCCGGCT GATTACACCC CCGTTCCCGG CTTCTCCCCG
GACATCGCCT GCCTCATCTA CACCTCCGGT TCCACCGGCA TGCCCAAGGG GGTGGTGGTG
CCCCACAGCA CCGTGGTGGA CGGGGCCGAG ATCGTCAGCA CCTACCTGGA GATCACCGAG
AAGGACCGGA TTATCAGCGT CCTTCCCTTC AACTTCGACT ACGGCCTCAA CCAGGCCACC
ACCGCCGTCC TCCACGGCGC GACCCTGGTG CTGCACCAGT TCGTCATGGT CAAGGATCTC
CTGGACCTCC TCGTCAAGGA AGAGATCACC GGCTTTGCCG GCATGCCCCC TATCTGGGCC
AAGCTCTTCA ACGACAAGAT CAAGCTCACC TACAACTCCG ACTTCCCCCA CCTGCGCTAC
CTCACCAACA GCGGCGGCAA GGTGCCCCGG ATCATGGTCT CCCGCATCCG CGAGTTTTTC
TCCAACTCCC GGCTCTTCCT CATGTACGGG CTCACCGAAG CATTCCGCTC CACCTTCCTT
CCCCCCGAGG AGCTTGACCG CCGGCCCGAC TCCATTGGCA AGGGGATTCC CAACGTGGAG
ATCCTGGTGG TGAACGCCAA GGGCGAAGAG TGTGCCCCGG GCGAAGAGGG GGAGCTGGTC
CACCGTGGCG CCCTCATCAC CCACGGCTAC TGGAACGACC CGGAAAAGAC CAAGGTGATC
TTCCGGAAGA ATCCCCGTTT CCACGATCAG CCCCACCTGC ACGAGACCGT GGTCTACTCC
GGCGACATCG TGAAGAAGGA CGAGGACGGC TTCCTCTACT ACGTCTCCCG CCGCGACGAG
ATGATCAAGA CCTCCGGCTA CCGGGTGAGC CCCACCGAGG TTGAAGAGGT GCTCATCGGC
TTGCCTGGCG TGAGCAACGT GGTTGTCTTC GGCAAGGAAG TGGAGTCGGG CGACCAGATC
ATCGTGGCGG TCATGGAGAC TGACCACGAG GAGGAACACA AGAAGGAGCT GCTCAAGGAG
TGCCGCAAGC GGCTCCCCAC CTACATGGTC CCCCAGGAGA TTCACTTCGA GAAGGCGTTC
AGGAAAACCG CCAACGGCAA GATTGACCGC TCGGGGATCA AGAAAGAGTG GCTGGCGGCA
GGGAAGAATT GA
 
Protein sequence
MSLRPYLLGH LLEDTAARLP DKVAVKHHDR TITYAQLHEE ALKMKGLIRG LGIKRGERVG 
IYLDKSIEQL TAMFGATLAG AVFVFINPIL KKEQIEYIVN DCQIQLMITT SELFRKNHLA
APGKLIHVDE PEHDREGHPC WPKLKATLPA DYTPVPGFSP DIACLIYTSG STGMPKGVVV
PHSTVVDGAE IVSTYLEITE KDRIISVLPF NFDYGLNQAT TAVLHGATLV LHQFVMVKDL
LDLLVKEEIT GFAGMPPIWA KLFNDKIKLT YNSDFPHLRY LTNSGGKVPR IMVSRIREFF
SNSRLFLMYG LTEAFRSTFL PPEELDRRPD SIGKGIPNVE ILVVNAKGEE CAPGEEGELV
HRGALITHGY WNDPEKTKVI FRKNPRFHDQ PHLHETVVYS GDIVKKDEDG FLYYVSRRDE
MIKTSGYRVS PTEVEEVLIG LPGVSNVVVF GKEVESGDQI IVAVMETDHE EEHKKELLKE
CRKRLPTYMV PQEIHFEKAF RKTANGKIDR SGIKKEWLAA GKN