Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3199 |
Symbol | |
ID | 8334552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 3526968 |
End bp | 3528266 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644956344 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003113947 |
Protein GI | 256392383 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0389134 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCTCG ATCGTCTTAC TGCTGCGCAT GCTGGGCCGT CGGCTTCGGG AGGGCTGACG GCGGCGGGTA AACGGTTCCG GTTCTGGGGG ACGGCCTACA CCTTCCTGAT CCTGCTGACC GGGACGAATC TGCCGACACC CCTGTACAAG GGCTACGAGG CGCGGTTCGG GTTCTCGCCG CTGACGCTGA CGCTGATCTT CACCGCCTAT GTCGCGGTCC TGATTCCCTC GCTGTTGGTG GTCGGGCCGG CTGCCGACGC GATCGGGTAT CGCGTCATGC TGCTGCCTGC GCTGTTCGTG GCGGCGGGTG GGGCGCTGGT GTTCGCGTTC GCGTCCGGAA CCGGGTGGCT GTTCGCGGGG CGCATCTTGC AGGGCGTCGC GATCGGGGCG GCGACCGGAC CGCTGACGGC GACGCTGACC GAGCTCGAAC CGCACGGCGA CCGGCGCAAG GCGGCGTTGG TCTCGACGGT GGCCACGGCC GGAGGACTCG GACTCGGGCC GCTGCTGGCG GGTTTCCTCG CGCAGTACGC GCCGGCGCCG CGCGTGCTGC CGTTCGCGCT GGAGATCGGG CTGCTGGTCC CGGCGGTGGC ACTGGTGCTG ACGCTGCCGG CGAACCGTGC CCGCACGCGG TGGCGCCCGC GCCGTCCGGA GATCCCGGCC GCCCTGCGTT CGGAGTTCGC GACGAGCGGG ACGGCGTGCT TCGCAGCGTT CGCGGTGGTG GGACTGTTCC TGACGCTGAT TCCGACCTAC GTCGCGACGC TGTCCGGGAG CAAGAACCTG CTCCTCGGCG GCGCGGCGGT GGCGCTGATG CTGGCGTGCT CGGCGATCGC GCAGGTAGTC GGGTACGGGA AATCGGCGCG CGGGCTGGAG ATCGCAGGGC TTCCGCTACT GGCGGTGGGG CTGGTGTCGC TGGCGATAGC AGGGAACGTG TCGTCGCTGG CGCTGTTGCT CGGCGCGACA GTCGTAGCCG GAGCGGGGCA GGGACTGACG TTCCTCGGCG GGCTGACGGC GATCAACGCG GTGGCGCCGG CGGATCGGCG AGCCGATGTG CTGTCGAGCT TCTTCGTGAT CCTCTATTTG GGCGTCGGCG TGCCGGTGGT GGGAGTGGGC TTCGTCGCGA CGCAGGTGGG TTTGCTGGCG GCGGTTCAGT ACTTCGCGTG GGGTGCGGCG GTGTTGTGCG TGGTGGTGCT GGCGGTGCTG GGGCGCAGAC GTACACGCGA AGGGGATGGG AAAACGGGGC TGGCGGTCGA GGCAGCTGAG GCTGGAACCG CCGCGCGGCG TCGGTTGACC GACCGGTAG
|
Protein sequence | MALDRLTAAH AGPSASGGLT AAGKRFRFWG TAYTFLILLT GTNLPTPLYK GYEARFGFSP LTLTLIFTAY VAVLIPSLLV VGPAADAIGY RVMLLPALFV AAGGALVFAF ASGTGWLFAG RILQGVAIGA ATGPLTATLT ELEPHGDRRK AALVSTVATA GGLGLGPLLA GFLAQYAPAP RVLPFALEIG LLVPAVALVL TLPANRARTR WRPRRPEIPA ALRSEFATSG TACFAAFAVV GLFLTLIPTY VATLSGSKNL LLGGAAVALM LACSAIAQVV GYGKSARGLE IAGLPLLAVG LVSLAIAGNV SSLALLLGAT VVAGAGQGLT FLGGLTAINA VAPADRRADV LSSFFVILYL GVGVPVVGVG FVATQVGLLA AVQYFAWGAA VLCVVVLAVL GRRRTREGDG KTGLAVEAAE AGTAARRRLT DR
|
| |