Gene Cwoe_5035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5035 
Symbol 
ID8735501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5368651 
End bp5370018 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content77% 
IMG OID646505662 
ProductNHL repeat containing protein 
Protein accessionYP_003396821 
Protein GI284046481 
COG category[V] Defense mechanisms 
COG ID[COG4257] Streptogramin lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0114728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00343119 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCGCACCG CACGCCTGCT TCCCCCGTTC CTGCTCGCCG GAGCGCTGCT CGGCCCGCTC 
GCGGGCGTCG CCGCGGCCGC GCCCGCCGTC GACGGCGAGT TCGCGCTCGG CGCCCAGCCG
CGCCATCTCG CGCTCGGCCC CGACGGGAAC ATGTGGGTCG CGCTCGACGG CGTCGCCGAC
GACGTCGCGA AGGTCGCGCC CGACGGCAGC GTGACGAAGT ACACGGCCGC GGCGATCACG
AACCCGGTCG GGATCGCGGC CGGCCCCGAC GGCAACCTGT GGGTCACCCA GAACGGCGCC
GTCGTGCGCT TCTCGCCCGC CGACCCGACG ACCGCGAGAG CGTTCCCCGT CCCGCAGATC
GTCGATCCGC GCGCGATCGT CAGCGGGCCG GACGGCAACC TCTGGACCGC CTCCGCCGAC
AGAGCGATCC AGGTCACGAC CGCCGGCGTC GCGAGAGACT TCACCGTCCC AGGGATGGGC
GCGCGCGGGA TCGCGCGCGG CGGCGACGGC GACCTCTACA TCGCCGACTT CGGCGGCCAG
CGGGTCGTCG GGCTGACGAC CGCGGGCGTG CCGACGTTCT ACAGAACCGG CGGCGGGCCG
CAGGAGGTCG CGGCGGGCCC CGACGGCCGC ATCGCCTACA CCAACCCGAC GAACGTGCCG
CAGCAGGTCG GCCGCTTCGT GCGCGGCGGC AGCGTCGCCA CGACCGACGT TCCCGGGACC
GACCCGTTCG GCATCACGCT CGGCGACGAC GGCGCCTGGT GGACGGCCGA CTTCGCCAGA
TCGACGATCA GCCGCCTGAC GACCGACGGC GCCGTGACGC CGCTCGCCGG CCTCAGCGCC
GGGTCCGGCC CGCGCTTCGT CGCGACCGGC GCTGGCGGCA CGCTGTGGGT CGCGCTGGAG
ACGTCGCAGA GAGTCGCGCG CGTGACGGGT GTGACCGCGC CGCCGCCGCC GCCCCCGCCC
GCACCGCCGG CCCCGCCCGC GCCTCCGCGG CCCGTCGACC CGCGTCCGGC CGACCGCCTC
GCTCCGCGCA TCGCGCTCAC GCTGCCGAGA CGGATCGTCG CCGGGCGCGC GCTGACGGCG
CGGATCGCGC TCTCCGAGCC GTCCGCGCTG ACGATCCGCT TCCAGCGTGT CCTGCCCGGC
CGCCGCGCCG GCCGCGCGTG CGTGCGACCG ACGCCGCGGC TGCGCAGAGC GCGCCGCTGC
ACCCGCGCCG TCACCGTCGC GAAGGCGACG CGGCGTGCCG GCGGTCTGCG CGTGAGGGTG
GTCATCGCCG GCCGGCGCGT GAGAGCGGGA CCGAGCCGCC TCGTCGTGAC CGCGACCGAC
GCGAACGGCA ACCGCACGAC GCACGTGGTG CGGCTGACCG TGCGCTGA
 
Protein sequence
MRTARLLPPF LLAGALLGPL AGVAAAAPAV DGEFALGAQP RHLALGPDGN MWVALDGVAD 
DVAKVAPDGS VTKYTAAAIT NPVGIAAGPD GNLWVTQNGA VVRFSPADPT TARAFPVPQI
VDPRAIVSGP DGNLWTASAD RAIQVTTAGV ARDFTVPGMG ARGIARGGDG DLYIADFGGQ
RVVGLTTAGV PTFYRTGGGP QEVAAGPDGR IAYTNPTNVP QQVGRFVRGG SVATTDVPGT
DPFGITLGDD GAWWTADFAR STISRLTTDG AVTPLAGLSA GSGPRFVATG AGGTLWVALE
TSQRVARVTG VTAPPPPPPP APPAPPAPPR PVDPRPADRL APRIALTLPR RIVAGRALTA
RIALSEPSAL TIRFQRVLPG RRAGRACVRP TPRLRRARRC TRAVTVAKAT RRAGGLRVRV
VIAGRRVRAG PSRLVVTATD ANGNRTTHVV RLTVR