Gene Cwoe_5731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5731 
Symbol 
ID8736207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp6135480 
End bp6138701 
Gene Length3222 bp 
Protein Length1073 aa 
Translation table11 
GC content78% 
IMG OID646506358 
ProductIntegrin alpha beta-propellor repeat protein 
Protein accessionYP_003397507 
Protein GI284047167 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCCC TCGCCGCCCG CTCGCGGGCG CGTCGACGCC GCGTGCTCGC ATGCGCCGCG 
CTCGGCGCCG TCGCCGCGCT CTCGACGCCT GCCGTCGCCG CAGCCGCTCC GACCGCCACG
CTGACGCCCG CCACCGCTTT CGACGCCGGC GGCCGCCAGA TCGGCGCGCC CGTCGCGCGC
ACCTACACGG TCAGAAGCAC CGGCGCGGAT CCGCTCGCGA TCGGCGCGAT CTCGCTCTCC
GGCCCCGACG CCGCGCAGTT CGCGATCGCG GCCGACACGT GCAGCGCGCG CGACGGCGCC
AATCCGCTCG CGCTCAACCA GACGTGCACC GTCCAGGTCG CGTTCGCGCC GGCGATCACC
GGCGCTCGCT CGACCACGCT GCGGATCGCG ACGAACGGCC CCGCGCTCGA GAGCGCGGCG
ATCGCCGGCC ACGGACGCGA CCTGCTGGCG TCGGAGGCGA CGCTGACGCA CGGCGCCGTG
CGCGTCGGCG CGGCCGCGCC GACGCGCACC GTCACGCTCA CCAACCGCGA CGCCGCGCCC
TACACGCTCG GTGCGGTGAC CGTCGGCGGC TCCAACGCGA GCCAGTTCTC CAAGACCGCC
GACACGTGCA GCAGCGCGAC GCTCGCCGCC GGCGGCAGCT GCGACGTGAC CGTCGCGTTC
GCGCCGACGA GCGCCGGCGC CAAGGCCGCG ACGCTCGCGC TCGCCGGCTT CGGCCCCGCG
CCCGTCGCGC TCGCCGGCAC CGGCGTCCAG CCCGCGACCG CGCTCGCACC GGCCCGGCGC
GACTTCGGCG TGCTCGCGCC CGGCGCCGCC TCCACGGCGC GCACGTTCAC GCTCACCAAC
AGCGGCAGCG GGCCGCTCGC CGTCGGCCGC GCCGCGATCG CGGGTGCTGA CGCGGGCGTC
TTCTCGATCA CTTCCGACGG CTGCTCGGAG ACGACCGTCG CGGTCGGCCG CGGCTGCGAC
GTCGCGGTCG TCTTCGCGCC CGAGCGCGGC GGCTGGCGCA GCGCGACGCT GGAGCTTCCG
ACCGACGTCG AGGCGGCGCC GCCGGTGCGC GCGCGGCTGT CCGGCCGTGG CTCCGGCGCT
GGCACCGACA GCGATCCGCT TGGCGAGCTG TTCGACCTCG CCGACCAGCC GCTCCTGCGG
CTCACCGGCG ACGGCGAGGA CGGCGCGTCG AACGTCGCCT CCGGTGCCTG CGACGTCAAC
GGCGACGGCT ACGACGACCT GCTCGCGGGT GCGTCGACCT GGAGCCGCAC GCCGGTCGAC
GCGTCGTGGG AGGGCGGCGT CTACGTGCGC TTCGGCGGTC CCGAGGTCGG CTCGGCAGAC
CTCGCCGCGC ACGGCGACGG CCGCGTGCTG CTGCTGGAGG GCGAGAAGCG CCGCAGCCAG
GCTGGCACCG GCGTCGGCTG CGCCGGCGAC GTCAACGGCG ACGGGATCGA CGACCTCGCC
GTCGGCGCGT GGGCGTACGA GTACGACGGC CGCCCGAGCG GGACCGCCGC CGCGCGCGGC
GTCGCGTACG TCGTCTTCGG CGCCGCCGAC CTCGCGCAGC AGAGCCCGTT CGACCTCGGC
CACCTCGGCG ACCGCGGCTT CCGGATCGAG GGCCGCGACC TGCCCGCGCA CGACCACCTC
GGCTACGCGG CGACGGGGAT CGGCGACGTG GACGGCGACG GCCTCGCCGA CCTCGCGCTG
CTCGCGAACA CCGCCGACTC CGCCGACGCG ACGCCCGCGC GCAGAAACAA CGGGATCCTC
TACCTCGTCC CCGGCCAGCG CGGCTCGGCG ACGGTCGACG TCGGCGCGGC CGGCACGACG
CTCGCGCAGA TCCACGGCGC CTCGCCCGGC AGCGCGGTCG AGCCGTTCGG GCTGATCGGC
ACCGTCGTGC CGCTCGGCGA CGTGAGCGGC GACGGCGTAC CCGACCTCGG GATCGGCTCC
TACACGGCGA CGGTGCTCGG CCGCAGAGCG GCCGGCGCGG TCTTCGCGAT CAGCGGCGCC
GCGCGCGGGC GCGTCGACCT CGCCGACAGC TCCTCGTGGC TGTTCGTCGT CGGTGGCGCG
TTCCAGAGCC ACCGCGTCGG CATCGGCCTC GGGGCGGCCG GCGACGTCAA CGGCGACGGG
CTCGGCGACC TCGTGATCGG CGCCGACTCG ACGGTCACGG CTGACAGCGA TGCGGCATAC
GTCGTCTACG GCGCGCGTGG GACGCAGGCC GCGCCGATCG ACACTTCCGC GCTCGGGGAG
CGCGGGTACC GGATCCTCGG CGCGGCCGGG TCGGCGACCG GCTACGGCGT CGACGGCATC
GGCGACGTCA ACCGGGACGG CTACGACGAC GTGCTCGTCG GCGCCTACTC GGCCGGCGCG
GGCGGATCGG CGTGGGTCGT CCACGGGCGG CCCGACCCGG TGACGCTGCC GGCCAACGAC
GCCGCCTCCG GGCTCCTGCC TGCGAACGCG GCCGACACGA CGCGCTACCT CGCGCTCGCG
ACGCTGACGC CGCAGGAGGG CAGCCGCCTC GACGCGCAGA CGGCCGGTGA GCGCTTCGGC
CGCGCCGTCG CCGGCGTCGG CGACCTCGAC GGCAACGGCG CGCGCGACCT TGCGATCGGC
TCCGACAGCG CCGCGCGGCG CGAGCGCGAC CGCGCCGGGG AGCTGACCGT CGCGCTGCTG
CCGGCGGCGG CACCGGCGTA CGAGCCGGGG CCGGGGGGAC CGGGGGAGCC CGGTGGGCCG
GGTGGGCCAG GGGGTCCGGG GGGGCCGGGT GGGCCGGGGG GACCGGTGGG ACCGGGGCCG
GGCCAGCCGG GTGGCGCGGG CGGCGGGGGC GGGGCGGGGA CGGGGCCGCT CACGCCGCCC
GCCACCGGCG GCAGACGGCC GCCGGCAAGA GCCGGCGTGC CGCGTGAGCG GACGGTCGCG
GCCGGTGCGG CGACGCTGCG CGTCCCGCGC GGGACGCTGG TGCCGTCGGT GCGCGGGACG
CTCGCGCTCG GCAGCGTCCG CTGCACCGCC GCCGTGGCGC GCTGCACGGT CTCCGCGACC
GTGACGGTGC GCGTCGGCGG CAGACGCTGG ACGTTGAAGC TCGCGCCGAA GACGCTGCGC
CGCGGCGTCT CCGTGCGGTT GTCGGCGACG CTGCCGCCCG CCGCACGCAG CGCGTTGAAG
CGGTCGGCCA GGGGGAGCCT CGTCGTACGC GTCGTCACGC GCGACGAAGG TGGCCGCCGG
GGCTCGGGGA CGTGGCGCGC GACGCTGCGA GGCTCACGAT GA
 
Protein sequence
MDPLAARSRA RRRRVLACAA LGAVAALSTP AVAAAAPTAT LTPATAFDAG GRQIGAPVAR 
TYTVRSTGAD PLAIGAISLS GPDAAQFAIA ADTCSARDGA NPLALNQTCT VQVAFAPAIT
GARSTTLRIA TNGPALESAA IAGHGRDLLA SEATLTHGAV RVGAAAPTRT VTLTNRDAAP
YTLGAVTVGG SNASQFSKTA DTCSSATLAA GGSCDVTVAF APTSAGAKAA TLALAGFGPA
PVALAGTGVQ PATALAPARR DFGVLAPGAA STARTFTLTN SGSGPLAVGR AAIAGADAGV
FSITSDGCSE TTVAVGRGCD VAVVFAPERG GWRSATLELP TDVEAAPPVR ARLSGRGSGA
GTDSDPLGEL FDLADQPLLR LTGDGEDGAS NVASGACDVN GDGYDDLLAG ASTWSRTPVD
ASWEGGVYVR FGGPEVGSAD LAAHGDGRVL LLEGEKRRSQ AGTGVGCAGD VNGDGIDDLA
VGAWAYEYDG RPSGTAAARG VAYVVFGAAD LAQQSPFDLG HLGDRGFRIE GRDLPAHDHL
GYAATGIGDV DGDGLADLAL LANTADSADA TPARRNNGIL YLVPGQRGSA TVDVGAAGTT
LAQIHGASPG SAVEPFGLIG TVVPLGDVSG DGVPDLGIGS YTATVLGRRA AGAVFAISGA
ARGRVDLADS SSWLFVVGGA FQSHRVGIGL GAAGDVNGDG LGDLVIGADS TVTADSDAAY
VVYGARGTQA APIDTSALGE RGYRILGAAG SATGYGVDGI GDVNRDGYDD VLVGAYSAGA
GGSAWVVHGR PDPVTLPAND AASGLLPANA ADTTRYLALA TLTPQEGSRL DAQTAGERFG
RAVAGVGDLD GNGARDLAIG SDSAARRERD RAGELTVALL PAAAPAYEPG PGGPGEPGGP
GGPGGPGGPG GPGGPVGPGP GQPGGAGGGG GAGTGPLTPP ATGGRRPPAR AGVPRERTVA
AGAATLRVPR GTLVPSVRGT LALGSVRCTA AVARCTVSAT VTVRVGGRRW TLKLAPKTLR
RGVSVRLSAT LPPAARSALK RSARGSLVVR VVTRDEGGRR GSGTWRATLR GSR