Gene Cwoe_0641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_0641 
Symbol 
ID8731069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp677739 
End bp679769 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content71% 
IMG OID646501254 
Producthypothetical protein 
Protein accessionYP_003392451 
Protein GI284042111 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTGC TGAGAGCTGT CGTGCTGCTG GTCGCGTTCC TGATAGTGCT GATGCTGCCC 
AGTCCGGCCA CCGCCGACGT CTACCAGTAC GCAAACGGCT GCTACGCGCT GCGCGACGTC
CAGAGCGGCC GCTTCGTCGT CCGCGACAGC ATCGGCTACG CCGCGAGCGC GACGACCGCC
GCCGGTGCGA CCCCGTTCCG TCTGCAGGCG ACCGCGCTCG GCAGCTACCT GCTCTACGGC
CCGGACGGCC GCATGCCCGC CTCCGGCCCG CTCGGCACGG TCATCCCGAC GACCGGGCCG
GGTCCCGTGG CTGACTGGCG CGCGGTCGAG GAGGGCGGAG CGCTGCGGCT GACCAACGTC
GCGAGCGGCC GCCGGCTCGG CGTCGGCGTG CTGCGCCGCG TGGCACAGAG CGACGCGACC
GCACCGCGCT GGTCGTTCAC CCCCGCCGAC GGCTGCGCGA CGTTTCCCGA GGCCGAGCTG
AACGTCTCCG GCACCCCGTT CACCGGGGCG AGCCCGACCG CGCGCGTGCG CGGGTTCCTC
GACACCCACG TTCACCTCAA CGCGGAGCGG TTTCTCGGCG GGCGCTTCCA CTGCGGCAAG
CCGTGGAGTC CGTACGGCGT CACGGTTGCG CTGCGCGACT GCTCCGACCA CTTCCCGAAT
GGCGCGGGCG CCCTGTTCGA GAACGTGCTG TCGACGGGCA GCCCGATCGG CACGCACGAC
ACCGACGGCT GGCCGAGCTT CGCCGGTTGG CCGCGCGACG AGTCGCTGAC CCACGAGGGG
ACGTATTGGA AGTGGATCGA GCGCGCCTGG CGCTCGGGCC TGCGCCTGAT GGTCAACGAC
CTGGTCGAGA ACCGCGCGCT GTGCGAGCTG TACCCGCTCA AGCAGAACGA CTGCGACGAG
ATGGCGAGCG CGTTCGCGCA GGCTGAGGAG ATGGTCGCCC TCCAGGACTA CGTCGACGCT
CAGTTCGGCG GCCCCGGCAG AGGCTTCTTG CGGATCGTCG AGAGCCCGGC CGAGGCGCGC
CGCGTCATCA ATGCCGGGAA GCTTGCGGTC GTGCTCGGGG TCGAGATCTC CGAGGTGCTC
GGCTGCGGTC AGTTCAACGG CGTCCCGCGC TGCAGCACGG CGCAGATCGA GGCCGAGCTC
GACCGGCTCC ACGACATCGG CGTGCGCTCG CTCTTCCCCG TGCACAAGTT CGACAACGCG
CTCGCCGGAA CGAAGTTCGA CGCCGGGACG ACAGGTGTGC TCGTCAACGT CGGCAACAAG
TACGCGACCG GCCGCTTCTG GGCCGCCGCG AACTGCGCGG GCGCCGCCGA CACCGACAAC
GAGCCGACCA ACCTGGCCGG CGATCAGGCC GCGCTGATCT ACACGCTGCT CGGTCCGCTC
GTCGCGCAGC CGCTGCTCGA GGGCCAGCTC CCGATCTATC CGCCGGGGCC GCTGTGCAAC
CCGAAAGGCC TCACGCCGCT CGGCGAGGCC GCGATCCGGG CGATGATCGA CCGCGGCATG
CTGATCGAGA CCGATCACAT GAGCGCGAAG GCGCGCCGGG AGACGCTCGC GCTGCTCGAG
GCCGAGGGCT ATGGGGGCGT GATCTCGAGT CACAGCTGGG GTGACGGCGG CAGCCGCCGC
CGCATCCAGC AGCTCGGCGG CATGGTCGGC CCGATCACGA GCGGAGCGTC GTCGTTCGTG
GAGGAATGGC GTGCAGCGCG CGCTGACCGC GACCCGGGTG AGCTGTTCGG CATCGGCTAC
GGCTCTGACA CGAACGGTCT CCACGCGCAG CCCGGGCCGC GCACGGGCGG AGCGGTCCAC
TACCCGTTCC GGTCGTTCGA CGGCGGGACG CTGATCGACC GCCAGCGCTC GGGTACCCGC
GTGTACGACG TCAACGCCGA CGGCGTCGAC CACTACGGCC TGTATCCAGA TTGGATCGAG
GACCTGCGCC TCGTCGGCGG CGACCAGATC GTCGAGGACA TGGCCGACGG CGCCGAGGCG
TACCTGCGGA TGTGGGAGCG GGCGGAGGCT GCTGCGCGGC CGGTGCGCTG A
 
Protein sequence
MKVLRAVVLL VAFLIVLMLP SPATADVYQY ANGCYALRDV QSGRFVVRDS IGYAASATTA 
AGATPFRLQA TALGSYLLYG PDGRMPASGP LGTVIPTTGP GPVADWRAVE EGGALRLTNV
ASGRRLGVGV LRRVAQSDAT APRWSFTPAD GCATFPEAEL NVSGTPFTGA SPTARVRGFL
DTHVHLNAER FLGGRFHCGK PWSPYGVTVA LRDCSDHFPN GAGALFENVL STGSPIGTHD
TDGWPSFAGW PRDESLTHEG TYWKWIERAW RSGLRLMVND LVENRALCEL YPLKQNDCDE
MASAFAQAEE MVALQDYVDA QFGGPGRGFL RIVESPAEAR RVINAGKLAV VLGVEISEVL
GCGQFNGVPR CSTAQIEAEL DRLHDIGVRS LFPVHKFDNA LAGTKFDAGT TGVLVNVGNK
YATGRFWAAA NCAGAADTDN EPTNLAGDQA ALIYTLLGPL VAQPLLEGQL PIYPPGPLCN
PKGLTPLGEA AIRAMIDRGM LIETDHMSAK ARRETLALLE AEGYGGVISS HSWGDGGSRR
RIQQLGGMVG PITSGASSFV EEWRAARADR DPGELFGIGY GSDTNGLHAQ PGPRTGGAVH
YPFRSFDGGT LIDRQRSGTR VYDVNADGVD HYGLYPDWIE DLRLVGGDQI VEDMADGAEA
YLRMWERAEA AARPVR