Gene AnaeK_4408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnaeK_4408 
Symbol 
ID6784423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. K 
KingdomBacteria 
Replicon accessionNC_011145 
Strand
Start bp4958700 
End bp4960940 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content74% 
IMG OID642765875 
Productcapsular exopolysaccharide family 
Protein accessionYP_002136740 
Protein GI197124789 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01005] exopolysaccharide transport protein family
[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCCGA AGCCCGGAGG CCCGGCCCTG GAGATGGTCC CCGACAAGCG CCCCGCGCCA 
CCGCCCGAGC CGGCCTACGC CGGCGACGCC GACGGCGACG AGGTCAGCCT CGCCGAGTAC
CTCGACGTGC TGGTGCAGGG GCGCTGGCTC ATCGCCGGCG CGGCGGCGGT CGCGCTCGCG
TGCGGCGTCG CCTACGCGCT CCTCGCCACG CCGGTGTACC GCTCCGACGC GCTCGTGCAG
GTGGAGGACA AGAAGGGCGG CACCGGCGCG CTGGGCGATC TCTCGGCGCT CTTCAGCGAG
GCCTCGCCGG CCGAGACCGA GATGGAGATC CTCCGCTCGC GCTCGCTGGT CGGGTCGGTG
GTGGACGCGC TGAAGCTCGA GATCTCGGCC GAGCCGCGCC GCTTCCCGGT GGTGGGACGC
TTCCTGGCGC GCCGCCACAA GGACGACGCG CCGGCCTCGG CGATGCCGGG GTTCGGGCGC
TACGGCTGGG GCGGCGAGAA GCTCACCCTG GGCCGGCTGG CGGTGCCGGA GGAGCTGGAG
GGCGAGCCGC TCACGCTGGA GGCCCGCGAG GGCGGGCGCT TCGCGCTCCT CGATCCGGAC
GGCGAGCTGC TGGTCGAGGG CGCGGTGGGC GCGGCCGCCT CGGGGCGGCG CACCGAGCTG
TTCGTGGCGG AGCTGGTGGC GCGCCCGGGC ACGCAGTTCC GGGTGTCGCA CCAGCCGCGC
GACGCGGCCA TCGCCGACCT GCAGGACGGC CTGCGCATCT CGGAGAAGGG CAAGAAGACC
GGCGTCATCC AGCTCGCGCT GGAGGACGAG GACCCGGCCC GCGCCGCGGC CATCCTCGAC
GCGCTCTCGA GCGCCTACCT GCGCCAGAAC GTGGAGCGCA AGAGCGCCGA GGCCGAGAAG
ACGCTCGAGT TCCTGGAGAC GCAGCTCCCC GAGCTGCGCG GCAAGCTCGA CGTGGCCGAG
CGCGACCTCG AGACCTACCG CTCGGCGAAG GGCAGCGTGG ACGTCTCCAT GGAGACGCAG
GCCGCGCTCA CCCGCGCGGT GGACATCGAG AAGGCCGCCT CCGAGCTGCA GCTCGAGATC
GCCGCCCTGC GCCAGCGCTT CACCGAGGAT CACCCGCTCC TCATCGCGGC CCGCCAGAAG
ATGGGCCGCC TCGACGAGGA GCGCCGGAAC CTCGAGGCGC GCATGCGCAA GCTGCCGGAG
GCGGAGCTGG AGTCGGCCCG CCGCCTGCGC GACGTGAAGG TCGCGAACGA GCTCTACCTC
ACGCTCCTCA ACAAGGCGCA GGAGCTGAAG GTCGTGAAGG AGGGCACGGT CGGCAACGTG
CGCATCCTCG ACGCCGCGCT CGTGCCCCTG AAGCCGGTCG CGCCGAAGCG CGGCGCGGTG
GTGGCGCTGG CGCTGCTCCT CGGCCTGGCC GGCGGGGTGG CGCTCGCGTT CGTCCGCAAG
GCGCTCGACC AGGGCGTCGA GGATCCGGAC GCGCTGGAGC GGGCCACCGG GGTGGGTGTC
CACGCCTCGG TGCCGCACAG CGACGCGGAG GGCATCGCCA CCCGCGCGGC GGGGCACGCC
GGCAAGCACC CGGTGCTCGC CCGCACCGAT CCGAACGATC TCGCGGTCGA GGCGCTCCGC
AGCCTCCGCA CCAGCGTGCA GTTCGCGCTG CTCGAGGCCA GCTCCAACGT CGTCACCGTG
GGCGGCCCGG CGCCGGGCAT CGGCAAGTCG TTCGTCACCG CGAACCTCGC GGTCCTGCTG
GCCGAGGCCG GCAAGCGCGT GGTGGTGGTG GACGCCGACC TCCGCCGCGG CCACCTGCAC
CGCTTCCTGG GCGGCGAGCG CGCGCCGGGG CTCACCGACG TCCTGAGCGG CGCGCAGACG
CTCGCGAGCG CCCTCCGCAC GACCGAGCAC GAGAACATCC AGCTCCTCAC CACCGGCACC
ATCCCGCCCA ACCCGGCGGA GCTGCTCGGC TCGGACCGCT TCCAGCGGCT GCTGGCCGAC
CTGTCGGCGA AGTGGGACCT CGTGGTGGTG GACACCCCGC CCATCCTCGC CGTGGCCGAC
GGCGCGCTCA TCGCGCGCCA GGCGGGCGTG AACCTGTTCG TGGTGAAGGC GGGCAAGCAC
CCGATCCGCG AGATCCAGGC CGGCCTGCGC CAGCTCACCC GCGCCGGCGC CCGCGTCCAC
GGCATCGTGA TGAACGACGT GCGCCTCGAC CGCGGCCTGG GCCGGCGCAG CGCGTATCAC
TATCAGTATT CGTACAAGTA G
 
Protein sequence
MTPKPGGPAL EMVPDKRPAP PPEPAYAGDA DGDEVSLAEY LDVLVQGRWL IAGAAAVALA 
CGVAYALLAT PVYRSDALVQ VEDKKGGTGA LGDLSALFSE ASPAETEMEI LRSRSLVGSV
VDALKLEISA EPRRFPVVGR FLARRHKDDA PASAMPGFGR YGWGGEKLTL GRLAVPEELE
GEPLTLEARE GGRFALLDPD GELLVEGAVG AAASGRRTEL FVAELVARPG TQFRVSHQPR
DAAIADLQDG LRISEKGKKT GVIQLALEDE DPARAAAILD ALSSAYLRQN VERKSAEAEK
TLEFLETQLP ELRGKLDVAE RDLETYRSAK GSVDVSMETQ AALTRAVDIE KAASELQLEI
AALRQRFTED HPLLIAARQK MGRLDEERRN LEARMRKLPE AELESARRLR DVKVANELYL
TLLNKAQELK VVKEGTVGNV RILDAALVPL KPVAPKRGAV VALALLLGLA GGVALAFVRK
ALDQGVEDPD ALERATGVGV HASVPHSDAE GIATRAAGHA GKHPVLARTD PNDLAVEALR
SLRTSVQFAL LEASSNVVTV GGPAPGIGKS FVTANLAVLL AEAGKRVVVV DADLRRGHLH
RFLGGERAPG LTDVLSGAQT LASALRTTEH ENIQLLTTGT IPPNPAELLG SDRFQRLLAD
LSAKWDLVVV DTPPILAVAD GALIARQAGV NLFVVKAGKH PIREIQAGLR QLTRAGARVH
GIVMNDVRLD RGLGRRSAYH YQYSYK