Gene Anae109_3387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3387 
Symbol 
ID5375652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp3978569 
End bp3980518 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content72% 
IMG OID640844906 
Producthypothetical protein 
Protein accessionYP_001380555 
Protein GI153006230 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0810] Periplasmic protein TonB, links inner and outer membranes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGCC CGACCCGTCG TCCCCTCCTG CCGTCCAAGG GCTGCCTCGC CGTCCTCCTC 
GCGGCGCTCG CCGTCGCTCC CGCCGCCCAC GCGCAGCTCG GCATCGATCT GTCCGCGCCG
CCCAAGGAGG AGCGCAAGCC GGCGAAGAAG AAGCCGGCGG CGAAGAAGCC CCCCGTGAAG
AAGCCCGCGG CGAAGAAGCC CGCGCCGGCG AAGCCTCCCG AGCCCGAGTC GCCTCGCGAG
GCGCCGCCGC CGGTCCCGGC GCCCGAGCCG GAGAAGCCGG TCGAGGCGCT CCCGGGCGAG
CCGCCCGAGC AGCCCGCGGC CCAGCCGCAG GAGCTCCCCG GCCTGAAGCT CGCGCCCGTC
GATCCCAAGG CGACGGCGAT CGCGAAGGAG CGCCTCGCCG CGGCGAAGAA GCTCCTCGAC
GAGAAGGCCA CCGAGACCGC CGCCCTCGCG TTCGACCAGA TCCTGCGCGA GCCCACCTTC
GCCGGCGTGC ACGACGAGGC GCGCTACCAG CGCGCGAAGG CGCTCGTGCG GATGGGCCTG
CACCACTCCG CGCTCGCGGC GTTCGACGAG GTCCTCGAGA AGGGGCCGCG CGGCTCGCGC
TTCTACCACT CCGCGATGGA GTGGGTGTTC CACGTCGGAC GGAAGCTCAA GAACGAGCAG
CCGGTGCTGA ACCGGGTCGC GCGCCACGCG CAGTGGGGCT TCCCGCCGGC GTACGAGGAC
CGCTTCCACT TCCTCCTCGC GAAGTACGAG TTCGAGCGCG GCCGGGCGCT CGCCGACGCC
GGGCGCACCG CCGACGCGAA GTCCGCGTGG GCCGAGGCGC GCCGGCTCGC GTCCATGGTC
CGCCGTGAGG CGGGCGCCAA GCCGCCCGTC TCGCCCGACG CCCCCTCCGC CGGGGACGAC
GCCGGCGACG TCTACGCGAA GGCGCGCTTC GTGGACGGCC TCGTGCTGTT CGCCCAGGGC
GACGATCAGG CCTCGGTCGA GGCGTTCAAG GAGGTCGTGC GCCTCACGAA CCCGAAGCGC
GGCCGCCACC CGGATCCGGA GCTGCGCGAG CTCGCGTTCC TGCAGCTCGC GCGCATCCAC
TACCAGAACC GGCAGAACCG CTACGCCATC TGGTACTACG GGAAGATGCC CTGGGGTGGG
GAGCGCTGGC TCGAGGGGCT GTGGGAGGCC TCGTACGCCC ACTACCGGAT CGCCGACTAC
GAGAAGACGC TCGGCAACCT GCTGACGCTC CAGTCGCCGT ACTTCCAGGA CGAGTACTTC
CCCGAGTCGT ACGTCCTCGA GGCGATCGTC TACTACGAGA ACTGCCGCTA CCCCGAGGCG
CGCCGCGTGC TCGAGTCCTT CTCCCGGCTG TACGAGCCGG TGTACGAGGA GCTCGCGGGG
ATCACCACGC GTCCGCAGAC GCCCGAGGCG TACTTCGAGG TGATCGAGCA GTCGCCGCGC
CAGAAGGGCG GGGCGATCAT GCGGCGCATC CTGAAGGTCG CCTACACCGA CCAGAACATC
CGCCGCCTCG CGGAGTCGAT CCGCGAGATC GAGGACGAGA TGGATCGGGG CATCGGCGGC
CGCCGCCCCG AGTTCCGCGA GTCGGCGCTC GCGAAGGAGC TGCTCGACAA GCTCGGCGCG
GACAAGGCCA CGCTCGTCCA GGAGGCCGGC GCCCGCGCCC GCGGCAAGCT CGAGTACGAG
CGCGACTCCC TGCGCACCCT CCTGGCCCAG TCGCTGCGCA TCCGCATCGA GGTCTCCCGC
AAGGAGCGCG AGGCCCTCGA GGGCGCGCTC GCGCGGGGGA GCCAGGTCGA GGTGGTGCGC
GATCTGAAGT ACTCGACCGC CGTCTCGGAC GAGCACCTGT ACTGGCCCTA CCAGGGCGAG
TTCTGGCGCG ACGAGCTCGG CACCTACTCG TACACCCTCA CGAAGGGCTG CAAGGACCGC
CTGCCGCGGT CCCGCGCGGC GGCCCGGTAA
 
Protein sequence
MTRPTRRPLL PSKGCLAVLL AALAVAPAAH AQLGIDLSAP PKEERKPAKK KPAAKKPPVK 
KPAAKKPAPA KPPEPESPRE APPPVPAPEP EKPVEALPGE PPEQPAAQPQ ELPGLKLAPV
DPKATAIAKE RLAAAKKLLD EKATETAALA FDQILREPTF AGVHDEARYQ RAKALVRMGL
HHSALAAFDE VLEKGPRGSR FYHSAMEWVF HVGRKLKNEQ PVLNRVARHA QWGFPPAYED
RFHFLLAKYE FERGRALADA GRTADAKSAW AEARRLASMV RREAGAKPPV SPDAPSAGDD
AGDVYAKARF VDGLVLFAQG DDQASVEAFK EVVRLTNPKR GRHPDPELRE LAFLQLARIH
YQNRQNRYAI WYYGKMPWGG ERWLEGLWEA SYAHYRIADY EKTLGNLLTL QSPYFQDEYF
PESYVLEAIV YYENCRYPEA RRVLESFSRL YEPVYEELAG ITTRPQTPEA YFEVIEQSPR
QKGGAIMRRI LKVAYTDQNI RRLAESIREI EDEMDRGIGG RRPEFRESAL AKELLDKLGA
DKATLVQEAG ARARGKLEYE RDSLRTLLAQ SLRIRIEVSR KEREALEGAL ARGSQVEVVR
DLKYSTAVSD EHLYWPYQGE FWRDELGTYS YTLTKGCKDR LPRSRAAAR