Gene Cwoe_2853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2853 
Symbol 
ID8733297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3045542 
End bp3047044 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content73% 
IMG OID646503466 
Productanthranilate synthase component I 
Protein accessionYP_003394647 
Protein GI284044307 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGCCC CGCACGCCCA GCCCGCCGCT GCCGCCGCGA ACGCGAACGC GCTGCGCGTC 
ACGCCGTCGC TCGACGAGGT GCGCGAGCTC GCGCGCGAGC ACACGCTCGT GCCGCTGCGC
CACACGTTCG TCGACGACAT CGAGACGCCC GTCTCCGCTT TCCTCAAGCT GCGCGGGGAC
GGCCCGTCGT TCCTGCTGGA GTCGGCCGAG CAGGGCCGCA TGGGACGCTG GTCGTTCATC
GGCTTCCGGC CGCGCAGCGT GCTGCGCTGG TCGCTCGGCG ACGGCGGCGA CCCGTACGCG
CTCGCCGCCG CCGAGGTCGC CCGCTCCAGA CAGGCGCAGG TCCCTGGCCT GCCGCCGTTC
TCCGGCGGCG CGGTCGGGAT CTTCGGCTAC GACCTCGTCC GCACGGTCGA GCCGCTCGGC
GAGCCGAACC CCGACCCGGT CGGCCTGCCG GACATGGCGC TGATGTTGAC CGACGTGATC
GTCGCGTTCG ACCACCAGCG CCACGAGCTG TCGATCCTCG CGAACGTCGA TGCCGCCGCC
GGCGACCTGG AGCAGGCGTA CGCCGCGGCG GTCGCGACGA TCGAGGAGGT CCGCTGGAAG
CTCTCCGGGC CGGTCCCGCG GCCGGCGCGG CCGCCCGCCG CGCGCGACCC TGAGCAGCCC
GTCGACTTCC AGAGCAACAT GCCGCGCGAG CAGTTCGAGG GCATGGTCGA GCGGATCGTC
GAGTACATCC ACGCCGGCGA CGCCTACCAG GTGGTGCCCT CCCAGCGCTG GTCGGCGGAG
GTCCCGATCG AGGCGTTCTC GATCTACCGC GGGCTGCGCG CCGTCAACCC CAGCCCGTAC
ATGTACTTCC TCGACTTCGG CGACTTCGAG ATCGCCGGCG CGAGCCCCGA GCCGCTGCTG
ACGGTGCAGT CCGGCGTCGT CCGCACGCGG CCGATCGCTG GCACGCGCCC GCGCGGCACC
GACGCCGCGG ACGACGCGCG CCTCGCCGCC GACCTGCTCG CCGACGAGAA GGAGCGCTCC
GAGCACGTGA TGCTCGTCGA CCTCGCGCGC AACGACGTCG GCCGTGTCAG CGAGTACGGC
AGCGTCAACG TCGACGGCTA CATGGAGATC GAGAACTACA GCCACGTGAT GCACATCGTC
TCGCGCGTCT CGGGCCGTCT GCGCGAGGGG ATCGGCCCGC TCGACGCGCT GCGCTCGATC
CTGCCGGCCG GAACGCTCTC GGGTGCGCCG AAGGTCCGCG CGATGCAGAT CATCGACGAG
CTGGAACCGG TCAAGCGGGG CGGCTACGGT GGGGCGATCG GCTACCTCTC GTACACCGGC
GACCTCGACA CGTGCATCCA CATCCGTACG GTCGTCGTCA AGGACGGCGT CGCCCACGTG
CAGGCGGGCG GCGGCACGGT CGCCGACGCG AAGCCCGACT ACGAGTTCCG CGAGTCCGAG
GCGAAGGCGC GCGCGGTGCG CCAGGCGATC GCGCTGGCGG TGGCGCAGCC GGAGTGGCCC
TGA
 
Protein sequence
MGAPHAQPAA AAANANALRV TPSLDEVREL AREHTLVPLR HTFVDDIETP VSAFLKLRGD 
GPSFLLESAE QGRMGRWSFI GFRPRSVLRW SLGDGGDPYA LAAAEVARSR QAQVPGLPPF
SGGAVGIFGY DLVRTVEPLG EPNPDPVGLP DMALMLTDVI VAFDHQRHEL SILANVDAAA
GDLEQAYAAA VATIEEVRWK LSGPVPRPAR PPAARDPEQP VDFQSNMPRE QFEGMVERIV
EYIHAGDAYQ VVPSQRWSAE VPIEAFSIYR GLRAVNPSPY MYFLDFGDFE IAGASPEPLL
TVQSGVVRTR PIAGTRPRGT DAADDARLAA DLLADEKERS EHVMLVDLAR NDVGRVSEYG
SVNVDGYMEI ENYSHVMHIV SRVSGRLREG IGPLDALRSI LPAGTLSGAP KVRAMQIIDE
LEPVKRGGYG GAIGYLSYTG DLDTCIHIRT VVVKDGVAHV QAGGGTVADA KPDYEFRESE
AKARAVRQAI ALAVAQPEWP