Gene Cwoe_3214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3214 
Symbol 
ID8733663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3420523 
End bp3422889 
Gene Length2367 bp 
Protein Length788 aa 
Translation table11 
GC content78% 
IMG OID646503832 
Producttransglutaminase domain protein 
Protein accessionYP_003395008 
Protein GI284044668 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.257235 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.234323 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCCA CCGCGACCCC GCCGCAGGCG CCGCCGCCCG GGCGACCGGG GCACGGCGCG 
CGGCTCGCCC CGCCGCCGGA GGGGCGCAGG CGCGCCGCCG ATCCGGCCCG GCAGGCGCGA
CCCCTGCTCT CGCCGCCGGT CGCCCGGCTC GTCACGCTGA CCGCGCTCGC CGCCTACGCG
GCGCTGCGCT GGGGGACGCT GGTCGAGCCG TACGGCGGCG GGCGGATGCT CCTGCTGCTG
GCGTTCGCGC TCGCCGCCGG CACGGCGGTC GGGCAGCTCG CGCGGCTGCC GCCGCTCGCG
CAGGCGCCCG CGGCGCTCGT CGTCGTCGTG GCGTGGGCGG CCCTGAGCGC GCTGGCCGCG
GGCGTCAGCG CGTCGCTGGT GCTCGACCCC GCCCACTGGG ACGACCTCGC GGCCGGGATG
CGGCAGGGGA TCGAGCAGCT CCCGCGCGTG CTCGTGCCGT ACAGCCAGCC CGACGTCTGG
CCGCGGATCG CGATCCTGCT CGGCGGCGCG CTGCTGCTCG CGCTCGGCTG CGTGATCGCG
CTCGGGGTGG GCCGGCCGGG CGGGGCCCGG TTCGGCCTCT CGTTCCGCGA CGGCGAGCCG
TTCGGCGGCG GGATCGCGCT GCGGCTGGCG GCGGCAACGC CGTTGATCGC GGCCTCGATC
GTCCCCGCCG CGATCATGGA GCCGGACGCT CCGGCGCTCC TCGGCGTGGT CCTGTTCGCC
CTGCTCGCCG CGTTTCTGTG GCTGGAGCGG CTGCCGCGCG CGTACGCGCC GGCGGCGACC
GTGCTGCTGC TCGTCGCGGG GCTCGGCGGC GTCGTCGCGA CGCCGTGGCT CGACCGCGAG
GAGCCGTGGG TCGACGCGCA GGCGCTCGTC GAGGACCTCG ACACGCCCGA TCCCGCGCGC
TTCGACTGGT CGCAGTCGTA CGGCCCGCTC GACTGGCCGC GCGACGGGCG CGAGGTGCTG
CGCGTCCAGT CCAGACGCGA GGCGTACTGG AAGGCGCAGA ACCTCGACGG CTTCGACGGC
GTGCGCTGGA CGCGGTCCGG GCCGCCGACG CCGGTGAGAC CGGACGGCGA GGTCCCGCGC
GAGGCGCTGG TGCGGCGCGG CTGGCGCACC GACGTGCACG TCACGCTGCG GTCGTTCTCG
ACCTCCGAGG TGGTCGGCGC CGGCACGACG CTCGGGATCC AGCGCGAGCC GACGACCTTC
TCGCCCGGCA TCAGCCCCGG CACGTGGACG TCGGACGAGC CGCTCGAGCC CGGCGACGGC
TACATCGCGC AGACGATCGT CCCGCGCCCG CGACCGACCG AGCTGCAAGG CGCAGGGACC
GACTACCCCG GCTGGATCGA CCGCTACCTG TTGGTCGGCC TGCCGCAGCA GGACGAGTTC
GCGCCTGCCC GCTGGGCGCC GGGCGGCGGC GTCGCGGCGC CGACGCCGTT CGGATCCCCG
CGGACCGCGT TCACCGACGC CAACGACGCG CGGCTCGCGA GCAGCCCGTA CGGGCCGGCG
GCCGCGCTCG CGGCGCAGCT CGCCGCCGGC CAGCCGACAC CGTACGCCTA CGTGCAGGCC
GTGCTCGCCT ACCTGCGCGG GGACGAGTTC CGCTACGACG AGAACCCGCC GCGCCGCTCG
CTGCCGCTCG ACGCGTTCCT GTTCCGCGAC AGAATCGGCT ACTGCCAGCA GTTCGCGGGC
GCGGCGGCGC TGCTGCTGCG GCTCGGTGGC GTGCCGACGC GCGTCGCCGC CGGCTTCACG
AGCGGCTCGC GCGACGGCGC GCGCGACGAG TGGGTCGTGC GCGACTACGA CGCCCACGCG
TGGGTCGAGG TCTGGTTCCC GCGGATCGGC TGGGTCAGAT TCGACCCGAC CCCGTCGACC
GCGCCCGCGC TCTCCGGCCA GGCGCCGGAG ACGCCCGCGC CGAGAAGCGC GGTCGCGCCG
TCGCTGCCGA GAGCGCCGCG GAGAGACAGC GGCGTCGCGC CGAGAGCGGC GGCGCCGGCG
CAGCGCGAGG ACGGTTCGTC GCTGCCGCTC GCGCTGCTGG CGCTCGGCGG CGCGATCGTG
CTGGGCGGCG GCGGCCTGCT GCTGCGGCGG GCGCTGCGAC CGCCGGTCGG CGCCGACGCG
CTGCTCGCCG AGCTGCGGCG GGCATTGCGG CGCACCGGGC GGCCGCCGTC GCCGCGCACG
ACGCTGGCGG AGCTGGAGCA GCGCTTCCAC GACTCGCCGG ACGCGGCCGC CTACCTGCGC
GCGATCCGGC TGGCGCGCTT CGGCGGCGAG GAGCCGGAGG TCACCGCGCG CCAGCGGCGG
GCGCTGCGCA ACGAGCTGGC GCGCGGCCTC GGCCTCGGCG GCCGTCTGCG CGCCCTCTTC
GCGCTGCCGC CGCGCTCCGG CCGCTGA
 
Protein sequence
MSATATPPQA PPPGRPGHGA RLAPPPEGRR RAADPARQAR PLLSPPVARL VTLTALAAYA 
ALRWGTLVEP YGGGRMLLLL AFALAAGTAV GQLARLPPLA QAPAALVVVV AWAALSALAA
GVSASLVLDP AHWDDLAAGM RQGIEQLPRV LVPYSQPDVW PRIAILLGGA LLLALGCVIA
LGVGRPGGAR FGLSFRDGEP FGGGIALRLA AATPLIAASI VPAAIMEPDA PALLGVVLFA
LLAAFLWLER LPRAYAPAAT VLLLVAGLGG VVATPWLDRE EPWVDAQALV EDLDTPDPAR
FDWSQSYGPL DWPRDGREVL RVQSRREAYW KAQNLDGFDG VRWTRSGPPT PVRPDGEVPR
EALVRRGWRT DVHVTLRSFS TSEVVGAGTT LGIQREPTTF SPGISPGTWT SDEPLEPGDG
YIAQTIVPRP RPTELQGAGT DYPGWIDRYL LVGLPQQDEF APARWAPGGG VAAPTPFGSP
RTAFTDANDA RLASSPYGPA AALAAQLAAG QPTPYAYVQA VLAYLRGDEF RYDENPPRRS
LPLDAFLFRD RIGYCQQFAG AAALLLRLGG VPTRVAAGFT SGSRDGARDE WVVRDYDAHA
WVEVWFPRIG WVRFDPTPST APALSGQAPE TPAPRSAVAP SLPRAPRRDS GVAPRAAAPA
QREDGSSLPL ALLALGGAIV LGGGGLLLRR ALRPPVGADA LLAELRRALR RTGRPPSPRT
TLAELEQRFH DSPDAAAYLR AIRLARFGGE EPEVTARQRR ALRNELARGL GLGGRLRALF
ALPPRSGR