Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4938 |
Symbol | |
ID | 8336292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5636427 |
End bp | 5638103 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644958037 |
Product | ABC-type sugar transport system periplasmic component |
Protein accession | YP_003115639 |
Protein GI | 256394075 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000971011 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAGCA CCAACGCCTT CACGCGCCGC GGATTCCTCG CGGCCTCCGC CGGCGCCGCC GGCGCGATCG GGCTGTCCCC GCTGCTGGCG GCCTGCGGCA ACAACGGCGG CAAGGGCGGG GCGAGCACCA AGGCGGCGAT CCAGGCCGTG CTGCCGACGT ACAAGCCGCT GTCCGGCGGG GTCACCCCCG ACATCCCGTC CGTGCCGGGC ACGAACGGCG CCATGACCGA CCCGGGTTTC CTGAAGTACC CCAGCACCCT GCCGAAGACG GTGACCGGCC AGGTCGGCTC CGGCGGGAGG TACGCCGGCG TCGCGCCGTC GTGGAACCCG GTCCCGCCGG CCGGCAACTC CTACTACGAG GCGGTGAACA AGGCGCTCGG CGCGACCTTC GTCTCCCAGC CGGCCAACGG CAACACCTAC AACACCGTCA TCCCGCCGCT GATCGCCGCG GACAAGCTGC CGGACTGGCT GTCCATCCCG GGCTGGCTGA ACCCCACCTT CGACACCGGC GGCCTGGTCG GCACCAAGCT GGCCGACCTG ACGACCTACC TCGGGGGCGA CGCGGTCCTG GAGTACCCGA ATCTCGCGGC CATCCCCAGC GGCGGCTGGA AGTGCGGGAT CTGGAACAAC CGGCTCTACG GCATCCCGTC GCAGACCGAC AGCCTGAGCT TCGCCGGCGC CATCTACTAC CGCAAGGACC TGCTGGACGC CAAGGGCATC ACCCCGAACG TCAAGACCGC GCAGGACTTC GAGGCCCTCG GCCGGGAGAT CAACAACCCC GGCGGCGGCG TGTGGGCGTT CGACGACATG CTGGTGTACC TCTACCAGGT CTTCAAGGTG CCGCTGGGCG GGTGGTACCT GGAGAACGGC AAGATCAAGA ACGTCGGCGA GCACCCCGCC ATGCTGGAGT GCCTGGCCTG GGCCAACAAG ATCGCCAAGG CCGGGTTGGT CCACCCCGAC GCGATCGCCG GAGTGAACAC CAGCAACCCC AGCCGGTTCA TGGCCGGCAA GGTGTACATC GAGGCCGGCG GCATGGCCGG CCTGAGCGGC CCGGACGCGA AGAACGGCAC CGCGGGCAAG GCCGGCTACC AGCGCGCGCT GTTCCCGCTG TTCTCCTCCG ACGGTTCGAC CCCGAGTATC GGCCTGGGCG GCTCCTCGGG CTGGATGAGC TATCTGAACA AGAATCTGAA CCCGGAGCAG ATCAAGGAGT GCCTGCGGAT CGCGAACTTC TTCGCCGCGC CGTTCGGGTC CTTCGAGTAC AACCTCCTCA ACTACGGAGT CGAAGGCGTC CACTACACGA TGGGCCCTGA AGGACCGGTG TTCACCAAGG AGGGCTCCAA CACGGCGGCC GACGGCATAT TCGGCTTCTT CAGCACCGCT CAGACCGCGG TCTACAACGC GGGGTACCCC GACGTCACCA AGGCCATGGA GGCTTGGTGC GCCGACGCGG CCAAGCACGC CTACAAGCCG ATGTTCTGGA ACCTGAACAT CAGCGTGCCC AGCCAGTTCT CCAAAACCGC CGCCCAGACC GAGTTGTGGG ACGCGACGCA GGCGGTGGCG CACGGAAAGC AGCCGGTGTC GTACTACCAG GACGCGTACT CCCGGTGGAA GAGCGGCGGC GGCGACGCCC TGGGGACCTG GTACCAGCAG AACCTTATTG ACAAGGGCCT CAGCTAG
|
Protein sequence | MSSTNAFTRR GFLAASAGAA GAIGLSPLLA ACGNNGGKGG ASTKAAIQAV LPTYKPLSGG VTPDIPSVPG TNGAMTDPGF LKYPSTLPKT VTGQVGSGGR YAGVAPSWNP VPPAGNSYYE AVNKALGATF VSQPANGNTY NTVIPPLIAA DKLPDWLSIP GWLNPTFDTG GLVGTKLADL TTYLGGDAVL EYPNLAAIPS GGWKCGIWNN RLYGIPSQTD SLSFAGAIYY RKDLLDAKGI TPNVKTAQDF EALGREINNP GGGVWAFDDM LVYLYQVFKV PLGGWYLENG KIKNVGEHPA MLECLAWANK IAKAGLVHPD AIAGVNTSNP SRFMAGKVYI EAGGMAGLSG PDAKNGTAGK AGYQRALFPL FSSDGSTPSI GLGGSSGWMS YLNKNLNPEQ IKECLRIANF FAAPFGSFEY NLLNYGVEGV HYTMGPEGPV FTKEGSNTAA DGIFGFFSTA QTAVYNAGYP DVTKAMEAWC ADAAKHAYKP MFWNLNISVP SQFSKTAAQT ELWDATQAVA HGKQPVSYYQ DAYSRWKSGG GDALGTWYQQ NLIDKGLS
|
| |