Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3759 |
Symbol | |
ID | 8335112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4244838 |
End bp | 4246115 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644956899 |
Product | protein of unknown function DUF201 |
Protein accession | YP_003114502 |
Protein GI | 256392938 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0439] Biotin carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGAACC TGTTATTCGT CGGCGGTGCG CGTCCGGTGC CGATCAGCAC GGTCATGGCC GAGCAGGCCC TGGCCCAGGC ACGGTCCAGG GGGATCCGCA CGCACGTCGT GAACCGGCCC GACGCCCTGG CCGGGACCCC GACCGTCAGC GCCGCCGCGG ACGCGGTCTC GGCCGTCGAC TTCGTGCCCT CGGAGCAGAC CGTGGCCTGG GCGCGGTCCC GGGCTCGCGA CGGCGAACGG TTCGACGCGG TGTTCGCGCT GCAGGAGATG GCGCAGGTCG CCGTCGCCGA GGTGGCCCGG GCGCTGGGGG CGCCGGGCAA CGAGCCCGAG GCCGTGCGCC GGATCCGTAC CAAGGACCTG TGCCGTGCGG CGCTGGCCGA GGCGGGCTTC GCGCAGCCGG CGGTGCGGCT GTGTGCTGAT GTCCGGGCGG CAGCCGATTT CCTGGAGGAG CTTTCGAAGT CCTCGAAGCC GGGGCCGTGG GTCGTCAAAC CCCGCGACGC GATGGGAAGC ATCGGGGTCA GCCTCGTGCG CGGGATCGCC GACCTGCCCG GCGCGGTCGC GGCGCTGCCG GACGAAAGCC CTTTCCTGAT CGAGGAGTTC GTCGAGGGTC CCGAGTTCAG CGTCGAAGGG GTCTTCCTCG GCGGCGAGCC GCGGATCCTG GCCGTCACCG CGAAGGAGAA GGCGCCGCCG CCGTTCTTCG TCGAGGTCGG CCACGTGCTG CCGGCCGAGA TCTCCGAGAC CGAGCACGAC CGGATCCGGG ACCGGGTGGC CGCGGCCCTG TCCACGCTGG GCCTGCGCAC CGGCGCCTTC CACGTCGAGC TGTGGCTCAC CGCCGACGGC CCGGTGCTCG GCGAGGTGCA CGGCCGCTTC GGCGGCGACT GGATCCACAC CATGCTGCAG CACGCGATCC CGGACCTGGA GGTCTTCGGG CTCGTCTTCG AGGACATGCT GGGCCTGCCC GGCACGCACA CTTCCCTGGA GCCGACGCGC GGCGCCGCCG TCCGCTACCT GGTTCCGCCG CCGGGCCAGG TGACCGCGAT CGAGGGGTGG GAGGAGGTGC TGGCGCATCC CGCGGTCCTG CACGCGCAAC TACTGGTCGC CCCCGGCGAC ATCATCAAAC CCCTGCGACA GTCCTCCGAC CGGGCCGGCT TCGTGGTCGT CGGCGCGGAT GACCCCGCAC TGGCGCGCAA ACTGGCGACC GAACTCGTGG ACTCGGTGCG CTTCACGGTC CAGGACGCGC CGGCCGACCG TCTGCCGGGG CTGTGGTCGC TCACATGA
|
Protein sequence | MKNLLFVGGA RPVPISTVMA EQALAQARSR GIRTHVVNRP DALAGTPTVS AAADAVSAVD FVPSEQTVAW ARSRARDGER FDAVFALQEM AQVAVAEVAR ALGAPGNEPE AVRRIRTKDL CRAALAEAGF AQPAVRLCAD VRAAADFLEE LSKSSKPGPW VVKPRDAMGS IGVSLVRGIA DLPGAVAALP DESPFLIEEF VEGPEFSVEG VFLGGEPRIL AVTAKEKAPP PFFVEVGHVL PAEISETEHD RIRDRVAAAL STLGLRTGAF HVELWLTADG PVLGEVHGRF GGDWIHTMLQ HAIPDLEVFG LVFEDMLGLP GTHTSLEPTR GAAVRYLVPP PGQVTAIEGW EEVLAHPAVL HAQLLVAPGD IIKPLRQSSD RAGFVVVGAD DPALARKLAT ELVDSVRFTV QDAPADRLPG LWSLT
|
| |