Gene Anae109_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_0033 
Symbol 
ID5376569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp42625 
End bp43872 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content72% 
IMG OID640841547 
Productaromatic hydrocarbon degradation membrane protein 
Protein accessionYP_001377237 
Protein GI153002912 
COG category[I] Lipid transport and metabolism 
COG ID[COG2067] Long-chain fatty acid transport protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.744264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACGA CGCCGATCCT CGCCACGCTC CTCCTCGCCC CCGCCCTCGC CTCGGCGAGC 
GGGTACGAGG TCATCTCCGT CAACCCGCGC GACCTCGCGC TCTCCCACTC CGCGGTGGCC
GTGCAGGTCG ACGCCGCCGC GGCGTCCCTC AACCCGGCCG CGCTCTCGAA GCTCGAGGGC
CCCACGGTCT CCGTCGGCGG CTCCCTCCTG AACATCTGGA CGGAGTGGGA CGGCGACCCG
GCCCGCGGGC CGGCCGGCCA CGCCCAGACC CGCTTCGAGC CGGTGACCCC CGTCGCGATC
TACGCGGGCT GGGGCACGAA GCTCGCCGAC CGCGGCTTCG GCGTCGGCGT CGGCTTCACG
CAGCCCTTCG GCGGCAACGT GTTCTGGGAG GACGACTGGG AGGGGCGCGG CCGCATCGTC
GAGGTGCAGC GCCGCTTCTT CGGCACCTAC GCCACCGCGG GCTACGAGGT CCTCCCGCAG
CTCCGCCTCG GCGGCGGGCT CGTCTGGTAC TACGGGTTCG AGTACCTGAA GCAGGGCATC
CAGCCCATCC CGGCGGCGTA CGGCGAGCTC GACACGAAGG GCGGCGGCGT CACCTACCAG
GTGTCCGCCG AGATCCAGCC GGTGCCCTCC TACCCGCTCG TCTTCGGCGT CGACTACAAG
CACAAGGCGC ACGTCACGCT CGAGGGCGAC GGCAACTTCG TGGTGCCGCC GTCCCTGGAG
AGCGCGGACA CGCGGGACCA GGGCGTGTCC CACGACGTGA CGCTGCCGAA CCTGCTCAAC
GTGGGCGTAG GGTGGCGCCC GGCGAAGCCC GTCCTCCTGA CGCTCCAGTA CTCCTGGTCG
CGCTGGGTGG AGTACGTGGA CGACACCTTC GAGGGCGACG CGGGCCTCAC GCTCACGGTG
CCCCGCGACT ACCGCAACGG CCAGGTCGTC CGCGGCGGCG TGGAGTGGCA GGCGCTGCCC
GCGCTCGCGC TGCGCCTCGG GCTCATGCGC GACACCTCCG GGCTGCGCGA CACGACCTAC
TCGCCGACGC TCCCGGACTC GAACACCACC GGCGTGTCGA CCGGGCTCAC CTGGGCGTTC
GGCAAGCGCG GGCTCGCCGT GAACGCCGCC TTCTTCTACG GCTACCGCGA CGAGGTGGAG
ACGGAGGGGG ACATCGCGTT CCCCGGCACG TTCCAGACCG ACATCATGAT CACCTCGCTG
AGCCTCAGCT GGAACACGGA CCTGGCCCGC GCCGCCCGGG CGCGCTAG
 
Protein sequence
MRTTPILATL LLAPALASAS GYEVISVNPR DLALSHSAVA VQVDAAAASL NPAALSKLEG 
PTVSVGGSLL NIWTEWDGDP ARGPAGHAQT RFEPVTPVAI YAGWGTKLAD RGFGVGVGFT
QPFGGNVFWE DDWEGRGRIV EVQRRFFGTY ATAGYEVLPQ LRLGGGLVWY YGFEYLKQGI
QPIPAAYGEL DTKGGGVTYQ VSAEIQPVPS YPLVFGVDYK HKAHVTLEGD GNFVVPPSLE
SADTRDQGVS HDVTLPNLLN VGVGWRPAKP VLLTLQYSWS RWVEYVDDTF EGDAGLTLTV
PRDYRNGQVV RGGVEWQALP ALALRLGLMR DTSGLRDTTY SPTLPDSNTT GVSTGLTWAF
GKRGLAVNAA FFYGYRDEVE TEGDIAFPGT FQTDIMITSL SLSWNTDLAR AARAR