Gene Cwoe_0841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_0841 
Symbol 
ID8731274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp881150 
End bp883015 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content75% 
IMG OID646501457 
ProductNHL repeat containing protein 
Protein accessionYP_003392649 
Protein GI284042309 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.81979 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGGC TCGCACGCTT CCTCCCGCCG CTCGCGGCGG CCGCGGTCGC CGCGGCCGCG 
ATCGTGCCCG GTGCCGCCGG CGCGGCGACC GCGAGAGTGA CGGTCCAGGT CGAGACGGCA
GCCGGCCCCG TCACGGGAGC GTCGGTCCGC CTCTTCGCCG CCGGCGCGCG CAGCGCGACG
GTCGCCGGGC GTGCGAGAAC CGGCAGCGAC GGCAGTGCCG CGGTCACCTA CCGCACGCCC
GCCGCGGGCA GCCCGCTCTA CGCCGTCGCC GACGGCGGCA GCGTCGGCGG CAGAGCGCTG
CCCGCGGGCG TCCGGCTCCT GTCGATCGCC GGCCGCGGAG GCTCGACGCC GACGACCGTG
TACGTCGAGG AGCGCAGCAC CGTCGCGGGC GCCTACGCGT TCGCCCGCTT CATCGACGGC
GCGCAGATCA GCGGCCCGTC GCCGGGGATG CCGAACGCGG CCGCCAGCGC CGCGAACCTC
TACGAGTCGG GCGCGGGCAA GGTCAGCTTC GTCGTCTCCA ACCCGCCGAA CGGGCTCGCG
ACCGAGGTGC TGCGGACGTT CAACACGCTC GCCACCGCGC TCGCCGCGTG CACGACCGGC
AGCTCCGCCG ACTGCCGCGC GCTGTTCGAC GCCGCACGCC CGCGCGGCGG CGCGCGCCCC
GCCGACACGC TCGCGGCCGT CGTCGCAATC GCGCGCAATC CGTCCCAGCA CGCGCGGGCG
ATCTTCGGCG TCGCCAGACG CGCCGGCGAC GACGGCTACA GGCCGGCGCT CAGCGCGCCG
CCCGGCTCGT GGATCCTTGC ACTCGTCTTC ACCGGCAGCG GGATGAACGC GCCCGGGCGG
ATGGCGTTCG ACCGCGACAG CAACGCGTGG ATCGGCAACA ACTTCCAGAT CCCGGGGACG
ACCGCCGGCC GCGAGCTGTC CGTGCTCGAC CCAGGCGGCC AGCCGACGCT CGGCAGCCCG
CGCACGGGCG GCGGTGTGCG CGGCCCCGGC TGGGGCACGG CGATCGACGG GCGCGGGCGC
ATCTGGCTCG CCAACTTCGG CGGCGACAGC GTCAGCGTCT TCACCCCGGA CGGGAGAGCG
CTGTCGCCGC GCGGCGGCTT CACGCAGGGC GGCTACAGCA GACCGCAGGG GATCACCGCC
GACCAGCGCG GCAACATCTG GGTCGCGAAC TTCGGCAACG ACAGCGTCAC GCTGCTGCCG
CGCGGCAACC CGCGCGCCGC GCGCAACATC AGAGGCGGCG GCATCTCGAA GCCGTTCGGC
ATCCAGGTCG ACGCGCGCGG CCACGTCTGG GTCACCAACG GCGCGGAGGA CCCCAGACGC
GGATCGGTCA CCGAGCTGCT GCCCGACGGC AGACCGACCG CCCGTTCGCC GATCACCGGC
GGCGGCCTCG CCTCGCCGCA GGGGCTCGCC GTCGACAGCA GCGGCAACAA GTGGGTGGCG
AACCTCGTCA GCCGCTCGGT GACGCGGATC TTGCCGGACG GCCGCGTCTC CGCCGACTCG
CCGCTCGGCC TGGGCAGCGT CGAGGGCGGC TGGGGCATCG CCGTCGACGG CGCCGACCAC
GTCTGGGTGA CCGGCTTCCT CGGCGCCAAC GTGACCGAGC TGTGCGGTGT CCGCGCGAGC
GCATGCCCGC CGGGCGCGCG CAGCGTGGGC GCGAAGATCT CACCCGCGCG CACGGGCTAC
AAGAGCGCGA GCATGGAGCA CATGACGGCG GTGCAGATCG ACGCCACCGG CAACGTCTGG
CTCGCCAACA ACTGGACGCT CGGCTCGTCC TTCGCAGAGT TCGTCGGAGG CAACGGGCTC
GTCCAGCTCG TCGGCGCCGC GACGCCGGTG CGGACGCCGC TGATCGGCCC GCCGACGCGG
CCCTAG
 
Protein sequence
MSRLARFLPP LAAAAVAAAA IVPGAAGAAT ARVTVQVETA AGPVTGASVR LFAAGARSAT 
VAGRARTGSD GSAAVTYRTP AAGSPLYAVA DGGSVGGRAL PAGVRLLSIA GRGGSTPTTV
YVEERSTVAG AYAFARFIDG AQISGPSPGM PNAAASAANL YESGAGKVSF VVSNPPNGLA
TEVLRTFNTL ATALAACTTG SSADCRALFD AARPRGGARP ADTLAAVVAI ARNPSQHARA
IFGVARRAGD DGYRPALSAP PGSWILALVF TGSGMNAPGR MAFDRDSNAW IGNNFQIPGT
TAGRELSVLD PGGQPTLGSP RTGGGVRGPG WGTAIDGRGR IWLANFGGDS VSVFTPDGRA
LSPRGGFTQG GYSRPQGITA DQRGNIWVAN FGNDSVTLLP RGNPRAARNI RGGGISKPFG
IQVDARGHVW VTNGAEDPRR GSVTELLPDG RPTARSPITG GGLASPQGLA VDSSGNKWVA
NLVSRSVTRI LPDGRVSADS PLGLGSVEGG WGIAVDGADH VWVTGFLGAN VTELCGVRAS
ACPPGARSVG AKISPARTGY KSASMEHMTA VQIDATGNVW LANNWTLGSS FAEFVGGNGL
VQLVGAATPV RTPLIGPPTR P