Gene Cwoe_1624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_1624 
Symbol 
ID8732064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp1710812 
End bp1712263 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content67% 
IMG OID646502242 
ProductMammalian cell entry related domain protein 
Protein accessionYP_003393427 
Protein GI284043087 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.471797 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGCC GCAGCACATC CGTGGCGGCG AACCCGGTGC TGATCGGCGC GGCGACGGTG 
CTCGTCGTGA TCGTCGCGGT CTTCCTCGCA TACAACGCGA ACAACGGCCT GCCGTTCGTG
CCGACTTACC AGATCTACGC GCAGGTGCCG GACTCGGCGA ACCTCGTGAC CGGCAACGAA
GTCCGGATCG GCGGCGACCG CGTCGGCATC ATCTCGGCGA TCGACCCGGT CGTTCATGAC
AACGGCAGAG TCACGGCGAG ACTGACGCTG AAGCTCGACA CGAACGTCAA GCCGCTGCCG
ACGGACTCGA CGTTCATCGT GCGCCCGCGT TCGGCGGTCG GCCTCAAGTA CCTCGAGGTC
ACGCGCGGCA GATCGAGAGA AGGGCTCGAC GAAGGTGCCA CCACCTCGCT CGCGCAGGCT
ACGCCGAGAC CCGTCGAGAT CGACGAGTTC TTCAACATGT TCGACGAGAA GACGCGCAAG
GCGAACCAGG CGAACCTGAA GATCTTCGGG GACGCGCTCG CCGGTCGCGG CATCGACCTC
AACGAGGCGA TCGTCGAGCT CGACCCGCTG ACGAGAAACC TGATCCCCGT CATGCGGAAC
CTGAATGACC CGCGCACGGG CTTCGGCGAG TTCTTCGGCG CGCTCCAGCG CACGGCGTCG
ATCGTCGCGC CGGTCGCCGA GCAGCAGGGG CAGCTGTTCC GGAACCTGTC GACGACGTTC
GACGCATTCG CGGCGATCTC GCGCCCGTAC CTGCAGGAGT CGATCAGCGG CGGGCCGCCG
GCGATGGAGG CGGCGATCTC GGCGTTCCCG TTCCAGCGCA AGTTCCTCGC CAACTCCGCC
GGCTTCTTCC GCGAGCTGCA GCCGGGCGCG CAGGCGCTGC GCACCTCGGC GCCGCTGCTC
GCAGAGGCGT TCACGGTCGG CACGAGAACG ATCACGCGCG CCTCGGCGCT GAACGAGCGG
CTCGCGCGCC AGATGAGATC GCTGCAGGCG TTCGCCGAGG ACCCGCAGGT GCCGCTCGGC
ATCAAGGGCC TGAACAACAC GGTCGACGTG CTCTCGCCGA CGATCGCGAA CCTGTCCGCG
ATCCAGACGC AGTGCAATTA CATCGGGCTG TTCCTGAACA ACCAGGCGAG CGTGCTGTCG
GACTACGACA ACAGCACGCC CTCGCAGGGT TCGTGGGCAC GCCTGCTCGC GATAGGTGGC
CCGATCGGCC CGAACAGCGA AGGCGGTCCT GCCTCAGCGC CCGCCGACGG CAGACCCACG
TACGCAGACG TCCCGGTCAA CAACCTGCAC ACGAACGTCT ATCCGAAGAC CGGAGCACCG
GGCCAGAACG GCGTCTGCAT GGCCGGCAAC GAGGAGTACG AGGTGGGCAG AACGGTCATC
GGAAACCCGC CTGGTTCGCC GATGAGAACG GCGGATACAC CAAGACTCCT GTTCGACGAT
TGGCAGCCGT GA
 
Protein sequence
MNRRSTSVAA NPVLIGAATV LVVIVAVFLA YNANNGLPFV PTYQIYAQVP DSANLVTGNE 
VRIGGDRVGI ISAIDPVVHD NGRVTARLTL KLDTNVKPLP TDSTFIVRPR SAVGLKYLEV
TRGRSREGLD EGATTSLAQA TPRPVEIDEF FNMFDEKTRK ANQANLKIFG DALAGRGIDL
NEAIVELDPL TRNLIPVMRN LNDPRTGFGE FFGALQRTAS IVAPVAEQQG QLFRNLSTTF
DAFAAISRPY LQESISGGPP AMEAAISAFP FQRKFLANSA GFFRELQPGA QALRTSAPLL
AEAFTVGTRT ITRASALNER LARQMRSLQA FAEDPQVPLG IKGLNNTVDV LSPTIANLSA
IQTQCNYIGL FLNNQASVLS DYDNSTPSQG SWARLLAIGG PIGPNSEGGP ASAPADGRPT
YADVPVNNLH TNVYPKTGAP GQNGVCMAGN EEYEVGRTVI GNPPGSPMRT ADTPRLLFDD
WQP