Gene A2cp1_4429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA2cp1_4429 
Symbol 
ID7300485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter dehalogenans 2CP-1 
KingdomBacteria 
Replicon accessionNC_011891 
Strand
Start bp4929718 
End bp4931961 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content75% 
IMG OID643597235 
Productcapsular exopolysaccharide family 
Protein accessionYP_002494812 
Protein GI220919508 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01005] exopolysaccharide transport protein family
[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCCGA AGCCCGGAGG CCCGGCCCTG GAGATGGTCC CCGACAAGCG CCCCGCGCCG 
CCGCCCGAGC CGGCCTACGC GGGCGACGGC GGCGACGGCG ACGAGGTCAG CCTCGCCGAG
TACCTCGACG TGCTGGTGCA GGGGCGCTGG CTCATCGCCG GCGCGGCCGC GGTCGCGCTC
GTGTGCGGCG TCGCCTACGC GCTCCTCGCC ACGCCGGTGT ACCGCTCCGA CGCGCTCGTG
CAGGTGGAGG ACAAGAAGGG CGGCACCGGA GGGCTGGGCG ACCTCTCGGC GCTCTTCAGC
GAGGCCTCGC CGGCCGAGAC CGAGATGGAG ATCCTCCGCT CGCGCTCGCT GGTCGGCTCG
GTGGTGGACG CGCTGAAGCT CGAGATCTCG GCCGAGCCGC GCCGCTTCCC GGTGGTGGGC
CGCTTCCTGG CGCGCCGGCA CAGGGGCGAC GCGCCGGCCT CGGCGCTGCC CGGGTTCGGG
CGCTTCGGCT GGGGCGGCGA GAAGCTCACG CTGGGCCGGC TGGCGGTGCC GGAGGAGCTG
GAGGGCGAGC CGCTCACGCT GGAGGCCCGC GAGGGCGGGC GCTTCGCGCT GCTCGATCCG
GACGGCGAGC TGCTGGTCGA GGGCGCGGCG GGCGCGGCCG CCTCGGGGCG GCGCACCGAG
CTGTTCGTGG CGGAGCTGGT GGCGCGCCCG GGCACGCAGT TCCGGGTCTC GCACCGGCCG
CGCGACGCGG CCATCGCCGA CCTGCAGGAG GACCTGCGCA TCTCGGAGAA GGGCAAGAAG
ACCGGCGTCA TCCAGCTCGC GCTGGAGGAC GAGGACCCGG CCCGCGCCGC GGCCATCCTC
GACGCGCTCT CGAGCGCCTA CCTGCGCCAG AACGTGGAGC GCAAGAGCGC CGAGGCCGAG
AAGACGCTCG AGTTCCTGGA GACGCAGCTC CCCGAGCTGC GCGGCAAGCT CGACGTGGCC
GAGCGCGACC TCGAGTCCTA CCGCTCGGCG AAGGGCAGCG TCGACGTCTC CATGGAGACG
CAGGCCGCGC TCACCCGCGC GGTGGACATC GAGAAGGCCG CCTCCGAGCT GCAGCTCGAG
ATCGCCGCCC TGCGCCAGCG CTTCACCGAG GACCACCCGC TGCTCATCGC CGCCCGCCAG
AAGATGGGCC GCCTCGACGA GGAGCGCCGG AACCTCGAGG CGCGCATGCG CAAGCTGCCG
GAGGCGGAGC TGGAGTCGGC CCGCCGCCTG CGCGACGTGA AGGTCGCGAA CGAGCTGTAC
CTCACGCTCC TCAACAAGGC GCAGGAGCTG AAGGTCGTGA AGGAGGGCAC GGTCGGCAAC
GTGCGCATCC TCGACGCCGC GCTCGTGCCC CTGAAGCCGG TCGCGCCGAA GCGCGGCGCG
GTGGTGGCGC TGGCGCTGCT CCTCGGCCTG GCCGGCGGGG TGGCGCTCGC GTTCGTCCGC
AAGGCGCTCG ACCAGGGCGT CGAGGATCCG GACGCGCTGG AGCGGGCCAC CGGGGTGGGC
GTCCACGCCT CGGTGCCGCA CAGCGATGCG GAGGGGATCG CCACCCGCGC GGCGGGGCAC
GCCGGCAAGC ACCCGGTGCT CGCCCGCACC GATCCGAACG ATCTCGCGGT CGAGGCGCTC
CGCAGCCTCC GCACCAGCGT GCAGTTCGCG CTGCTCGAGG CGAGCTCCAA CGTCGTCACG
GTGGGCGGCC CGGCGCCGGG CATCGGCAAG TCGTTCGTCA CCGCGAACCT CGCGGTCCTG
CTGGCCGAGG CCGGCAAGCG CGTGGTGGTG GTGGACGCCG ACCTCCGCCG CGGCCACCTG
CACCGCTTCC TGGGCGGCGA GCGCGCGCCG GGGCTCACCG ACGTCCTGAG CGGCGCCCAC
ACGCTCGCGG GCGCCCTCCG CACGACCGAG CACGAGAACA TCCGGCTCCT CACCACCGGC
ACCATCCCGC CCAACCCGGC GGAGCTGCTC GGCTCGGAGC GCTTCCAGCG GCTGCTCGCC
GAGCTGTCGG CGACGTGGGA CCTCGTGGTG GTGGACACCC CGCCCATCCT CGCGGTGGCC
GACGGCGCGC TCATCGCGCG CCAGGCGGGC GTGAACCTGT TCGTGGTGAA GGCGGGCAAG
CACCCGATCC GCGAGATCCA GGCCGGCCTG CGCGCGCTCA CCCGCGCCGG CGCGCGCGTC
CACGGCATCG TGATGAACGA CGTGCGCCTC GACCGCGGCC TGGGCCGGCG CAGCGCGTAT
CACTACCAGT ATTCGTACAA GTAG
 
Protein sequence
MTPKPGGPAL EMVPDKRPAP PPEPAYAGDG GDGDEVSLAE YLDVLVQGRW LIAGAAAVAL 
VCGVAYALLA TPVYRSDALV QVEDKKGGTG GLGDLSALFS EASPAETEME ILRSRSLVGS
VVDALKLEIS AEPRRFPVVG RFLARRHRGD APASALPGFG RFGWGGEKLT LGRLAVPEEL
EGEPLTLEAR EGGRFALLDP DGELLVEGAA GAAASGRRTE LFVAELVARP GTQFRVSHRP
RDAAIADLQE DLRISEKGKK TGVIQLALED EDPARAAAIL DALSSAYLRQ NVERKSAEAE
KTLEFLETQL PELRGKLDVA ERDLESYRSA KGSVDVSMET QAALTRAVDI EKAASELQLE
IAALRQRFTE DHPLLIAARQ KMGRLDEERR NLEARMRKLP EAELESARRL RDVKVANELY
LTLLNKAQEL KVVKEGTVGN VRILDAALVP LKPVAPKRGA VVALALLLGL AGGVALAFVR
KALDQGVEDP DALERATGVG VHASVPHSDA EGIATRAAGH AGKHPVLART DPNDLAVEAL
RSLRTSVQFA LLEASSNVVT VGGPAPGIGK SFVTANLAVL LAEAGKRVVV VDADLRRGHL
HRFLGGERAP GLTDVLSGAH TLAGALRTTE HENIRLLTTG TIPPNPAELL GSERFQRLLA
ELSATWDLVV VDTPPILAVA DGALIARQAG VNLFVVKAGK HPIREIQAGL RALTRAGARV
HGIVMNDVRL DRGLGRRSAY HYQYSYK