Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2908 |
Symbol | |
ID | 8733352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 3104980 |
End bp | 3106890 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646503521 |
Product | UspA domain protein |
Protein accession | YP_003394702 |
Protein GI | 284044362 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.1088 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0371725 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAAC GCCGCCTCCA CGGCCTCGAG CGCGTGCTCG GGGTCAACGC ACTCTTCTCG ACCGCCTACG GCAACGTCGG CTCGTCGATC TACTACGCGC TCGGGCTCGT CGCGTCGTTC GCGCTCGGCT TGACGCCGGT CGTCTTCGTG ATCGCGGGCG TGATCTTCTA CCTGACCGCC TCGACGTACG CCGAGGCGAC CGCGATGTAC CCGGAGGCGG GCGGCTCGTC GAGCTTCGCG CGCCACGCCT TCAACGAGTT CTGGTCGTTC TTCGCCGCGT GGGGCCAGAT GCTCAACTAC ACGATCACGA TCTCGATCTC GGCGTTCTTC GTGCCGCACT ACATCGGCTC GCTCTTCTGG GAGCCGTTGC GGCACGCGCC GGGCGACGTG ATCGGCGGCT GCGTCGTCGT CGCGATCCTG GCGGCGATCA ACGTCTTCGG CGCGAAGGAG ACGGCCGGTC TCAACATCAC GCTCGCGGTC GTCGACTTCG CCACGCAGCT GCTGCTGGTG ATGCTCGGCC TCGTGCTCGT CTTCTCGCCG GACACGCTGA TCGACAACGT CCAGTGGGGC ATCGCCCCGA CGTGGAGCAA CTTCATCGTC GCGATCCCCG TCGCGATGAT CGCCTACACC GGCATCGAGA CGATCTCGAA CATGGCGGAG GAGGCGAAGG ACGCGGCCAG AACGATCCCG AAGGCGATCA ACCGCGTCGT GATCGCCGTC TTCGCGATCT ACGCGCTGCT GCCGGCGATC GCGCTCAGCG CGCTGCCCGT CGCCTGCGAC AGAGCGGGGG AGTGCAAGAC GCTGCTCGGC CTGCCGGAGG ACGAGGGCGG CTTCGCCGGC GATCCCGTGC TCGGCATCGT CGAGCACATG GATCTCGGCC CGCTCCAGCA TCCCGGCGAG ATCTACGTCG GTCTGCTCGC CGCAACGATC CTCTTCCTCG CCACGAACGC CGGCATCATC GGCGTCTCGC GGCTGACCTA CTCGATGGGC GTCCACCGGC AGATGCCGGA CAAGCTGCGC CAGCTGCACC CGAGATTCCG CACGCCGTGG ATCGGCATCA TCGTCTTCTC GATCGTCGCG TGCATCGCGA TGATCCCTGG CCAGGCGGGC TTCCTCGGCA ACCTCTACGC GTTCGGCGCG ATGCTGTCGT TCACGATCGC GCACGCGGCC GTGGTGCGAT TGCGGATCAA GTACCCCGAC GCGAGACGGC CGTTCCGCGG ACCCGGCAAC GTCCGTTGGC GCGGACACGA CATCCCGCTG TTCGCCGTCT TCGGCGGCAT CTTCACCGCG CTCGCGTGGT GCGTCGTCAC GGCCCTCTAC CTCGACGTGG CGATCACGGG GCTCAGCTGG CTGGCGATCG GGGTCGTCGT CTTCGTCACG TTCCGCAAGC GCCAAGGGCT CGACCTCGTC ACGACGACGA AGGTCGCGGT CCCCAAGCCG GTGATCGACC ACGAGGCCGA GTACGAGTCG GTCCTCGTCG CCTTCGACGA GCGCGAGTAC GTCAGAGACG TGCTGTCGAC GGCGATCAAG CTCGCCGCCC GGCGCCACCG CGGGATCCAC GTGATCGTCA CGATCACGGT CCCGCCGACG AGCCCGATCC ACGCCGCGAT GCCCGAGCAG GAGTTGGCCG CGCAGTCGAT CATCGAGCAG GCGAAGGTGC AGGGCGGGCG CCGCGTGACC GGCCACTGGG AGAAGGTCCG CCCCGGTCAG GCCGGCCGCC GCATCGTCGA CGAGGCGAAG GTGATCCAGG CGCGCGCGAT CGTGATGCCG CTGCCGGTCC GCGGCGGTGG CGGCTCGGTC TTCGGCCGCA CGCTGGAGAC GGTGCTGGCA GAGCGTCCGT GCCGCGTCAT CATCGAGTCC GGTCGCGGCC GCCGCCGCGA GGCGACGCGC CAGGCGGCGG AGATCGTCTG A
|
Protein sequence | MAKRRLHGLE RVLGVNALFS TAYGNVGSSI YYALGLVASF ALGLTPVVFV IAGVIFYLTA STYAEATAMY PEAGGSSSFA RHAFNEFWSF FAAWGQMLNY TITISISAFF VPHYIGSLFW EPLRHAPGDV IGGCVVVAIL AAINVFGAKE TAGLNITLAV VDFATQLLLV MLGLVLVFSP DTLIDNVQWG IAPTWSNFIV AIPVAMIAYT GIETISNMAE EAKDAARTIP KAINRVVIAV FAIYALLPAI ALSALPVACD RAGECKTLLG LPEDEGGFAG DPVLGIVEHM DLGPLQHPGE IYVGLLAATI LFLATNAGII GVSRLTYSMG VHRQMPDKLR QLHPRFRTPW IGIIVFSIVA CIAMIPGQAG FLGNLYAFGA MLSFTIAHAA VVRLRIKYPD ARRPFRGPGN VRWRGHDIPL FAVFGGIFTA LAWCVVTALY LDVAITGLSW LAIGVVVFVT FRKRQGLDLV TTTKVAVPKP VIDHEAEYES VLVAFDEREY VRDVLSTAIK LAARRHRGIH VIVTITVPPT SPIHAAMPEQ ELAAQSIIEQ AKVQGGRRVT GHWEKVRPGQ AGRRIVDEAK VIQARAIVMP LPVRGGGGSV FGRTLETVLA ERPCRVIIES GRGRRREATR QAAEIV
|
| |