Gene Cwoe_4972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4972 
Symbol 
ID8735438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5300677 
End bp5301846 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content77% 
IMG OID646505599 
ProductCapsule synthesis protein, CapA 
Protein accessionYP_003396758 
Protein GI284046418 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.162067 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGCCC AACCCCCTCC CCCGCCGCCG GCTCCCGCTC CGCCACCGCT CGTGCTGCGC 
GCACCGCGGC TGGTCGCCAC CGGCGTCGCG TTCGAGCTGC GCGGGCGCGG GGCGAGCGCC
GGCGAGCGCG TGCGGGTGCA GCTGCGGCTC GACGGCCGCT GGCGCGAGCT GGCGGCGGCG
CGCGCCGGCA GGGACGGCCG CGTCACGACG CGTGTCGCGC CGCGCACGCC GCGGCCCGCG
TACCGCCTGC GGATCGCGAC GCGCGACGGG CGCCTCTCGG ACGGCGTCGT CGTGCGCACG
CGCGACGTCA CGCTCGCGGC GGTCGGCGAC ATCAACCTCG GCGACGCGAC GGCCGGGGCG
ATCGCGGCCG GCGGCGTCAA CTATCCGTGG ACGAGCGTCG CGCCGGCGCT GCGCGGCGCC
GACGTCGCGT TCGGCAACCT CGAGTGCGCC ATCTCGACGC GCGGCGCGCC GGTGCAGAAG
CAGTACACCT TCCGCGGCAG CCCGGCGGCA CTGCGGGCGA TGCGCGACTA CGCCGGCTTC
GACGTGCTCA ACCTCGCCAA CAACCACGTC GGCGACTACG GCACCGCGGC GCTGCTCGAC
ACCGTCGAGC ACGTCCGCGC CGGCGGTATG GAGGCGGTCG GCGCCGGCGG CTCGCTCGCC
TCCGCCGCGG CGCCGCGCGT CGTCGAGCGG CTGGGGCTGC GGATCGCCTT CGTCGGCTTC
TCGAACATCC TCCCGAGCGA GTTCTTCGCG ACGCCGTCAC GGGCCGGCAC GCAGCCGGCG
ACGACGGCGC AGATCCGCGC CTCCGTCGCC GCCGCCAGAC GCCGCGCCGA CGTCGTGATC
GCGACCTTCC ACTGGGGCGT CGAGCTGGAC CCGGTCGAGA ACGGCGCCGA GCAGGCGTTC
GCCGCGACCG CGCTGGCGGC CGGCGCGACC GCCGTGATCG GCGGTCACCC GCACGTGCTG
CAGCCGATCC GCATGCTCGA CGGCGGCCGC CGCCTCGTCG CGTACAGCCT CGGCAACTTC
GTCTTCGCCT CCCACCGCGC GGCGACCGTC CGTACCGGCG TCCTGCACCT CGACCTGTCG
GCCCGCGGCG TCGAGCGGAC GCGCTTCCAG CACGCCCGCA TCGACGGCGT CAAGCCGCTG
CTGACGGGGC GCTGGACGCG CGTCGGCTGA
 
Protein sequence
MVAQPPPPPP APAPPPLVLR APRLVATGVA FELRGRGASA GERVRVQLRL DGRWRELAAA 
RAGRDGRVTT RVAPRTPRPA YRLRIATRDG RLSDGVVVRT RDVTLAAVGD INLGDATAGA
IAAGGVNYPW TSVAPALRGA DVAFGNLECA ISTRGAPVQK QYTFRGSPAA LRAMRDYAGF
DVLNLANNHV GDYGTAALLD TVEHVRAGGM EAVGAGGSLA SAAAPRVVER LGLRIAFVGF
SNILPSEFFA TPSRAGTQPA TTAQIRASVA AARRRADVVI ATFHWGVELD PVENGAEQAF
AATALAAGAT AVIGGHPHVL QPIRMLDGGR RLVAYSLGNF VFASHRAATV RTGVLHLDLS
ARGVERTRFQ HARIDGVKPL LTGRWTRVG