Gene Cwoe_3739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3739 
Symbol 
ID8734194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3972386 
End bp3973474 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content71% 
IMG OID646504361 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_003395531 
Protein GI284045191 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.270575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0735445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATCG AATTCACCGA GAAGATCCGC CGCATCCCCG TCTACCCGAC CGCGGGCGGC 
TACTCGCTCG GCGACGAGTT CGCGATGCTC GCGAGCAACG AGGCGCCGTT CGGGCCGATG
CCCGGCGTCG TCGAGGCCGC GACGGGCGCG ATACAGAACG CGAACCGCTA CCCGGACCCG
TCGAACCTGT CGCTGCGCAG AGCGCTCGCG GCGCGCTTCG ACTTCATGCC CGAGCGGATC
GCGATAGGCG CCGGCTCCTG CGACATCCTG CTCAGCGCCG CCGAGGCGCT GCTGGAGCCC
GGGGCGGAGG TCGTCTACTC GTGGCCGTCG TTCTCGGTCT ACCCGCAGAT GGCCGCCGCG
ACCGGCGCGC GCGCCGTCGT CGTCCCGCTC GACGAGGAGG ACCGCTACGA CCTCGACGCG
ATGCTCGCCG AGATCACCGC CGCGACGCGG CTGGTGCTGC TCTGCAACCC GAACAACCCG
ACCGGCACAG CGCTCCCGCT GGACGCGATC GAGGCATTCG TGGCGGCGGT CCCGAGACAC
GTCTGCGTGA TCGTCGACCA GGCCTACGGC GAGTTCTCGG TGCTCGACGA CCCGGACGCG
TCGGTGTCGC TCGCGCGCCG CTATCCGAAC GTCGTCCTGC TGCGCACGTT CTCGAAGGTC
TACGGCCTCG CCGGCATGCG CGTCGGCTAC GCGCTGTGCG GCGACGAGCG CTTCCGCGTC
GCCGTCGAGC AGGTGCGCCA GCCGTTCTTC TGCAACGCCG TCGGTCAGGC GGCGGCTGAA
GCCGCGCTGC TGCGGCAGGA CGTCGTGACG CAGCGCGTCG AGGAGACCGT CGCCAACCGC
CTCGAGATGG AGGAGGGGCT GCGCGAGATG GGCCTGAAGG TCGCCGAGTC GCAGGCGAAC
TTCATCTGGC ACTCGCTCGG CGACGGCGAC GAGCAGGAGA TCCTCGACGG ACTCAGAGAG
CGCAAGGTGC TGATCCGCTC CGGCGGTGCG CTCGGCCGCG CGGGCTGGGC CCGCACGACG
ATCGGCACGG CCGCCGAGAA CCGCCGCTTC CTCGCCGCGC TGCGCGAGCT GGTCCAGCAG
CCGGTCTAG
 
Protein sequence
MAIEFTEKIR RIPVYPTAGG YSLGDEFAML ASNEAPFGPM PGVVEAATGA IQNANRYPDP 
SNLSLRRALA ARFDFMPERI AIGAGSCDIL LSAAEALLEP GAEVVYSWPS FSVYPQMAAA
TGARAVVVPL DEEDRYDLDA MLAEITAATR LVLLCNPNNP TGTALPLDAI EAFVAAVPRH
VCVIVDQAYG EFSVLDDPDA SVSLARRYPN VVLLRTFSKV YGLAGMRVGY ALCGDERFRV
AVEQVRQPFF CNAVGQAAAE AALLRQDVVT QRVEETVANR LEMEEGLREM GLKVAESQAN
FIWHSLGDGD EQEILDGLRE RKVLIRSGGA LGRAGWARTT IGTAAENRRF LAALRELVQQ
PV