Gene Anae109_3999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3999 
Symbol 
ID5376758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4676520 
End bp4678913 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content69% 
IMG OID640845526 
ProductTIR protein 
Protein accessionYP_001381161 
Protein GI153006836 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.703882 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGATG TTCAGCAGGC GTTGAAGCGC GCATGCGCGC TGGTGAGGTC GGGAGCAACC 
GTCGGGACGG GATACCTCGT CGCGTCCGAC CTCGTCGCCA CGTGCGAGCA CGTCGTGCCG
GGAGCGCGCG AGGGCGACGC GGTCACGCTC ACCTTCGGGT ACCCCCTGCC AGAGGTTGCG
CGCACGGCGC GCGTGGCGCG CACGGATCCA GCAGAGGACT GTGCCGTCCT ACGCCTGGAC
GAGCCGATGA CGGACCGCGT CCCGCTGCAA CTGTCGGCCG CCCCCCTCCC GCCTCGGAGC
GCATGGTTCA CGTTTGGATA CCCGGCGGTG ACCAAGGCGG ACGGCACGCA CTTCGCCGGC
GTCGTCGATG ACGCAAAGGG CGTCAAGTCG GGACGGTACG TGATCGTGCT CACGTCCGAG
AAGATCGCGG CCGGGATGTC GACCCCCATC CACGGCCTCT CGGGTAGCCC GGTCGTGATC
GGCCAAGCCG TAGCGGGACA CATCGCCTCG GTGCGACCCG ACCCGGACTT CCCGCAACGT
GCCGCCTTTG GCGAGGTGTT CGCCTGTCCG GCCGCGGGCG TAATCCGCCT GCTCGATGCG
ATCGGCCGGC CCGTACCGCT CGCGGCCGCC GCCGTCCCCC CGCCGCACGC TACGCCACCC
GCCTTGGGGC ATCGGGCCTA TCACGCCTTC GTCAGCTACC GGTCGACGGA CCGGGGCTTC
GCGCTGGACC TCGTTGAGCG CCTTGAGGCG CGCGGATTCA GCATCTACAT CGACCAGCGC
GAGGTGCTAC CCGGGGACGA GCTCGCGGTG TCGCTGCAGA ACGCCATCGC CGCGAGCGCG
GCGGCAATCA TCCTCGTCAG TCGCAAATGG GTCGAGTCGC CCTGGTGCCA GCAGGAGATG
ACCGTCCTCC TGCACCGAGC AGTCGATAGC CGCACCCCCG TCATCCCTGT TCGCCTGGAC
GATGTCGAGT TGCCGCCAAT GCTCGCCTCG AAGGTGTGGC TCGACTGCGC CGGAGCTTCG
ATCGCTCCGG CCGAACAACT CGAGAGGATC GTCGCTGCGA TCGGCGAAAA CCCCGTCCGA
GCGGCTGTGC CGAGGCCGCC GGTGGCCCCG AGCGGACCCA CCACGCCCTG GAGCCGAATC
GAGGCGAGCG CGCGCGATCC GGAGGAGAGC ACGCTGACCG CGGCGCAGGC GCTCATCTCG
ATCGGCGAGC CGCGAGCAGC GGTTGAGATG CTGAAAGGGG CGGGACAAGG AACCCGTGCG
CGCCAGCTAC GTGCGCTCGC CCTGTCGAAG TCCGGGTGGA ACGAGGCGGC GATCACCGAG
CTCGAACCGC TGGTCTCAAG TGGCCAGCTC GACGCCGAGA CCGGGGGCAT TCTCGGCGGC
CGCTACAAAG ACCTGTGGAT CGAACGTGGC GACGCCAAGT ACTTGCATAA GGCCTACCGT
ATCTACCGAA CCGCATACGA GCGCAGCGGC GACACGTACC CCGGAATCAA CGTCCTCGCG
ATGGGGCTGT ACCTCCGCAA ACACGCGCCT CACCTCCTCG GAGCGAAGGA CGGCGCGCAG
CTCGCGGCAG TCGCTACCGC GGTGCGCGCG AAGACTGAGC CGATCACGGA GGAGACGGAT
GACCACTGGC AGCTCGCCAC GAGGGCCGAG GTTCTGCTCC TTGGCGGTGA CCTCGACGGC
GCCCGGCGCT TCTACGCGCT CGCCGCCAGT GCCAACCCGC TAGCCACGCA AGACATCGCG
AGGATGCGAA ACCAGGCGCG CCGGAACCTC CGATACCTCG GGCTGCCGGA GGACGGGGTC
GACGCGTCGC TCCAGGTCCC GTGCGTCGCC GCGTTCACTG GACACATGAC TGACCTCCCA
GGCCGGCCGA CGCCGCGTCT TCCGGAGGCC AAGGTCGGCG CTCTGCGCGC GCGGATCCGC
GCGTTGCTCG ACCAGCATCG CATCGGTTTC GGCTTCAGCA GCGCGGCGCG CGGCTCCGAC
ATCCTGTTCG CGGAGGAGGT CCTGGCGCGC GGTGGCCGGG TCCGGCTGTT CCTGCCGTTC
GCCCCCACTC TCTTCCGGAT AACCTCGGTC GAGACCCCCG CGGACCCTCG TTGGATCGCT
CGGTTCGACG ACCTCCTCAC CCGCGCGGCG AGCACGGATC CGCGCGTCAA CGTCTCCGTG
CTCGCAAACA TGCCTCCGCC CGAGGCGGAG CACCCCCGCG CCTATGCCGC GTGCAACCTC
GCTGTGCAGA ACGCCGCGGT CGAGAAGGCG CAGCTACTCG ACTCGAAACC GAGCCTAATC
GCCGTCTGGG ACGGCAACCC GGACGGCGGC GCTGGCGGAG CCGCCGATGC GATCCGCGAT
TGGTGCGATC GAGGAGCGGG CGACGTCGAG ATCATCGACA CGGCGACGCT GTGA
 
Protein sequence
MEDVQQALKR ACALVRSGAT VGTGYLVASD LVATCEHVVP GAREGDAVTL TFGYPLPEVA 
RTARVARTDP AEDCAVLRLD EPMTDRVPLQ LSAAPLPPRS AWFTFGYPAV TKADGTHFAG
VVDDAKGVKS GRYVIVLTSE KIAAGMSTPI HGLSGSPVVI GQAVAGHIAS VRPDPDFPQR
AAFGEVFACP AAGVIRLLDA IGRPVPLAAA AVPPPHATPP ALGHRAYHAF VSYRSTDRGF
ALDLVERLEA RGFSIYIDQR EVLPGDELAV SLQNAIAASA AAIILVSRKW VESPWCQQEM
TVLLHRAVDS RTPVIPVRLD DVELPPMLAS KVWLDCAGAS IAPAEQLERI VAAIGENPVR
AAVPRPPVAP SGPTTPWSRI EASARDPEES TLTAAQALIS IGEPRAAVEM LKGAGQGTRA
RQLRALALSK SGWNEAAITE LEPLVSSGQL DAETGGILGG RYKDLWIERG DAKYLHKAYR
IYRTAYERSG DTYPGINVLA MGLYLRKHAP HLLGAKDGAQ LAAVATAVRA KTEPITEETD
DHWQLATRAE VLLLGGDLDG ARRFYALAAS ANPLATQDIA RMRNQARRNL RYLGLPEDGV
DASLQVPCVA AFTGHMTDLP GRPTPRLPEA KVGALRARIR ALLDQHRIGF GFSSAARGSD
ILFAEEVLAR GGRVRLFLPF APTLFRITSV ETPADPRWIA RFDDLLTRAA STDPRVNVSV
LANMPPPEAE HPRAYAACNL AVQNAAVEKA QLLDSKPSLI AVWDGNPDGG AGGAADAIRD
WCDRGAGDVE IIDTATL