Gene GM21_3708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3708 
Symbol 
ID8139082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4273775 
End bp4274830 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content60% 
IMG OID644871328 
Productthioesterase superfamily protein 
Protein accessionYP_003023486 
Protein GI253702297 
COG category[I] Lipid transport and metabolism 
COG ID[COG1607] Acyl-CoA hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGATG AAACGACCAA AAATGAGGTG CTCCAACTGA CGCCGCACGA CACCCGATAC 
CTGTTCGTGC TCCCTTTCTC GACGGACCCG GCGCTGGCAC GGCGCTTTCT GGCACGAGAC
CGCCAGATGC CGGGCAATAT CCGGTTTGGG AAACTTCTCG AGGTGCTGGA CAAGGTGGCC
GAGAACACCG CGCTCGGGTA CGTGAACCAG TTCTATCCCG ACGCGCGGGT GGTGACCGCG
GCCATCGACA ACATCGTGGT ACGAAACCCC GCGGACACCA CGCACGACCT GGTATTTTCA
GCGCAGATAA ACCATGTGGG GAAATCGTCC ATGGAGGTGG GGATCAGGGT GGAATGTTTG
GGGACCTGTT CAAACCACCT GGCGAGCTGC TACTTCACCA TGGTCGCCCG TTCGGCGGAC
AGCAACGAGG CAAAGAGCCT CGCGCTTCCC CCTCTTGAGT ACAGGCAGCA GATAGAGCAG
AAAAGACACC ACAAGGCGGA ACAGCGCCGC CAGGCGTACC GAGAGAGCCT GGCCAAAGCC
GAGGAAATGC CTTCGCTCGA GGAGTACCTC TTCCTGAAGA AGCTGCATAA GGAGCAGGAA
GCTCCAGACT TCGACGGCAT ACGCGCGGGG CAGCTGGCAC TGGAGTCAAC GGTCCGCGCC
TACCCGGAGC AGGAGAACGT GCCAAAGACG ATCTTCGGGG GATACCTGAT GCGTAAAGCC
TACGAACTGG CCGCGCTCGC AGCCGAGATG GTGACCGACG ACCGTGTGGT TCCCTGCCAG
GTGAACCGGA TCAACTTCAA CCAGCCGGTG CTCCTCGGGG ACCAGTTGAA GTTCACGGCG
CGGGTGGTCT TCACCGGAAA AACCACCATC ACGGTCCAGT CGGACATCCA ACGCTTCGAC
CGCGACGCCC ACAACACCGC GCTTTCCAAT TCATGCCTCT TCACCTTTAG GAACGTCGGC
AGCGAGATGG AACCCAAGCC GGTACCATTC ATCTACCCGG TCACCTACGC GGAAGACGCG
AGATTCCTGA ACGCCTACCG GCAGCGGCTG GATTGA
 
Protein sequence
MSDETTKNEV LQLTPHDTRY LFVLPFSTDP ALARRFLARD RQMPGNIRFG KLLEVLDKVA 
ENTALGYVNQ FYPDARVVTA AIDNIVVRNP ADTTHDLVFS AQINHVGKSS MEVGIRVECL
GTCSNHLASC YFTMVARSAD SNEAKSLALP PLEYRQQIEQ KRHHKAEQRR QAYRESLAKA
EEMPSLEEYL FLKKLHKEQE APDFDGIRAG QLALESTVRA YPEQENVPKT IFGGYLMRKA
YELAALAAEM VTDDRVVPCQ VNRINFNQPV LLGDQLKFTA RVVFTGKTTI TVQSDIQRFD
RDAHNTALSN SCLFTFRNVG SEMEPKPVPF IYPVTYAEDA RFLNAYRQRL D