Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_1624 |
Symbol | |
ID | 8732064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 1710812 |
End bp | 1712263 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646502242 |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_003393427 |
Protein GI | 284043087 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component |
TIGRFAM ID | [TIGR00996] virulence factor Mce family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.471797 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGCC GCAGCACATC CGTGGCGGCG AACCCGGTGC TGATCGGCGC GGCGACGGTG CTCGTCGTGA TCGTCGCGGT CTTCCTCGCA TACAACGCGA ACAACGGCCT GCCGTTCGTG CCGACTTACC AGATCTACGC GCAGGTGCCG GACTCGGCGA ACCTCGTGAC CGGCAACGAA GTCCGGATCG GCGGCGACCG CGTCGGCATC ATCTCGGCGA TCGACCCGGT CGTTCATGAC AACGGCAGAG TCACGGCGAG ACTGACGCTG AAGCTCGACA CGAACGTCAA GCCGCTGCCG ACGGACTCGA CGTTCATCGT GCGCCCGCGT TCGGCGGTCG GCCTCAAGTA CCTCGAGGTC ACGCGCGGCA GATCGAGAGA AGGGCTCGAC GAAGGTGCCA CCACCTCGCT CGCGCAGGCT ACGCCGAGAC CCGTCGAGAT CGACGAGTTC TTCAACATGT TCGACGAGAA GACGCGCAAG GCGAACCAGG CGAACCTGAA GATCTTCGGG GACGCGCTCG CCGGTCGCGG CATCGACCTC AACGAGGCGA TCGTCGAGCT CGACCCGCTG ACGAGAAACC TGATCCCCGT CATGCGGAAC CTGAATGACC CGCGCACGGG CTTCGGCGAG TTCTTCGGCG CGCTCCAGCG CACGGCGTCG ATCGTCGCGC CGGTCGCCGA GCAGCAGGGG CAGCTGTTCC GGAACCTGTC GACGACGTTC GACGCATTCG CGGCGATCTC GCGCCCGTAC CTGCAGGAGT CGATCAGCGG CGGGCCGCCG GCGATGGAGG CGGCGATCTC GGCGTTCCCG TTCCAGCGCA AGTTCCTCGC CAACTCCGCC GGCTTCTTCC GCGAGCTGCA GCCGGGCGCG CAGGCGCTGC GCACCTCGGC GCCGCTGCTC GCAGAGGCGT TCACGGTCGG CACGAGAACG ATCACGCGCG CCTCGGCGCT GAACGAGCGG CTCGCGCGCC AGATGAGATC GCTGCAGGCG TTCGCCGAGG ACCCGCAGGT GCCGCTCGGC ATCAAGGGCC TGAACAACAC GGTCGACGTG CTCTCGCCGA CGATCGCGAA CCTGTCCGCG ATCCAGACGC AGTGCAATTA CATCGGGCTG TTCCTGAACA ACCAGGCGAG CGTGCTGTCG GACTACGACA ACAGCACGCC CTCGCAGGGT TCGTGGGCAC GCCTGCTCGC GATAGGTGGC CCGATCGGCC CGAACAGCGA AGGCGGTCCT GCCTCAGCGC CCGCCGACGG CAGACCCACG TACGCAGACG TCCCGGTCAA CAACCTGCAC ACGAACGTCT ATCCGAAGAC CGGAGCACCG GGCCAGAACG GCGTCTGCAT GGCCGGCAAC GAGGAGTACG AGGTGGGCAG AACGGTCATC GGAAACCCGC CTGGTTCGCC GATGAGAACG GCGGATACAC CAAGACTCCT GTTCGACGAT TGGCAGCCGT GA
|
Protein sequence | MNRRSTSVAA NPVLIGAATV LVVIVAVFLA YNANNGLPFV PTYQIYAQVP DSANLVTGNE VRIGGDRVGI ISAIDPVVHD NGRVTARLTL KLDTNVKPLP TDSTFIVRPR SAVGLKYLEV TRGRSREGLD EGATTSLAQA TPRPVEIDEF FNMFDEKTRK ANQANLKIFG DALAGRGIDL NEAIVELDPL TRNLIPVMRN LNDPRTGFGE FFGALQRTAS IVAPVAEQQG QLFRNLSTTF DAFAAISRPY LQESISGGPP AMEAAISAFP FQRKFLANSA GFFRELQPGA QALRTSAPLL AEAFTVGTRT ITRASALNER LARQMRSLQA FAEDPQVPLG IKGLNNTVDV LSPTIANLSA IQTQCNYIGL FLNNQASVLS DYDNSTPSQG SWARLLAIGG PIGPNSEGGP ASAPADGRPT YADVPVNNLH TNVYPKTGAP GQNGVCMAGN EEYEVGRTVI GNPPGSPMRT ADTPRLLFDD WQP
|
| |