Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5501 |
Symbol | |
ID | 8336861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 6346615 |
End bp | 6348012 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644958605 |
Product | hypothetical protein |
Protein accession | YP_003116201 |
Protein GI | 256394637 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02678] conserved hypothetical protein TIGR02678 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000679714 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00404063 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCGAGC ACCTTGCCGC AGCAGGAGTC ATTCCCAGCG ACCTTGGATC CTTCCAGCAT GCCGTACGCC TGGTGCTGAC CAATGATCTG ATCACTGCCG ATCGGCCGCG GTCCGGAAGT CTGGACGTGG TGCTTCGGTG GGCCGACCGG ATCTCCGTCG ATATGCAGGA GCTGTTCGCC TATACCCTCA TCGCGACCGC GCGCCAAGTA CGGCTGGTTC GTCGAAACGA CGTCCTGGAC CCGACCCAGC ACTTGATATT CACCAGCAGG TCCGGCCGTC AGTTCGACCG GCGGCGCCTG GCCTATCTGT GCCTGGTGCT CGCCGGCTTC CAGCGGTCGC GGATCGAGGT GTCACTCGTG GACATCGTCA AAGCAGTCAC CCCATTGGCG AACGCTCAGC CCAGCCTCGG GTTCGAACCG ACCATCACCG AACATCGCCG AGCCGTGGTG GACGTCCTGG ACTGGTTGAC CGATCGAGGG GCACTGCGGC TGTCGGACGG CTCACTGGAC GCGTGGGCGG GCGGCGACCG CGAAGGCGAC GCCCTGTACG ACATCGATCA CGATGTATGC GGTGCCTTGT TCCGGCCTCC CGCGGCACTG CAGCATGTGG CCAGCGCGGC GCAGCTCATG GAAGGTGCCG ACGGGCCGAA CAAGAGCGCC CGGCGGGAAG CCGCCACACG GCGCGCCCGC AGACTCTTGA TCGAACAACC TGTCGTTTAC TTCGAACGAT GCGAATCAGC TGTGGCCGCC GCGCTGCGGT CACCAGATCT GGCGGAGAAC CTGGCACGCT TGACCGGTCT GGTGGTCGAG CGTCGAGCCG AAGGCGTCAT GCTCGCAGAC CCGTCAGGAC GCTTCACGGA TCGGATCTTC CCGCTGAAGG GCGGCGCGGT GAATCGGACC GCCGGCCTTA TTCTGGGCGC GATCGCCAAC CTGCTCGAAG ACCCCAATGA GGCCCGAAGG CTGCCACGTC TGCCGGTCCC GACCCTCACC GAGGAGACGG CCGACCTAGT CACCCGCATC GACTCCGCAC GTCCGCTGCG CGACGACGAC CGACCGGGCC GGCCGCAGGC ATCAGTTACG GACGCGGAGC CGAATCAGCT AGGCCTGAAC GCGCCTTTCC TGTCGACCGC GCAGGTGGCC ACCATCGTCG ACGAGCTGTA CGTCGAGTTC GGCGCATCGT CGTTTACAGC GATCTGGCAG GGCGACCCCA CCGGACTCGC TCGGGCTGCC ACCAGATTCC TCGCAGACCT GGGACTCGTC CACGAAATCC CCGGTGGACT CCTCGTACTG CCCGCAGCCG CCCGCTACCG AAACATTCAA GGAGTTCTCC CTCAGCCCGC CCCTGACGGA TTGTTCCCCC TGGACTTCAC CGAGAACAAG GACGGAACCG ACGGATGA
|
Protein sequence | MAEHLAAAGV IPSDLGSFQH AVRLVLTNDL ITADRPRSGS LDVVLRWADR ISVDMQELFA YTLIATARQV RLVRRNDVLD PTQHLIFTSR SGRQFDRRRL AYLCLVLAGF QRSRIEVSLV DIVKAVTPLA NAQPSLGFEP TITEHRRAVV DVLDWLTDRG ALRLSDGSLD AWAGGDREGD ALYDIDHDVC GALFRPPAAL QHVASAAQLM EGADGPNKSA RREAATRRAR RLLIEQPVVY FERCESAVAA ALRSPDLAEN LARLTGLVVE RRAEGVMLAD PSGRFTDRIF PLKGGAVNRT AGLILGAIAN LLEDPNEARR LPRLPVPTLT EETADLVTRI DSARPLRDDD RPGRPQASVT DAEPNQLGLN APFLSTAQVA TIVDELYVEF GASSFTAIWQ GDPTGLARAA TRFLADLGLV HEIPGGLLVL PAAARYRNIQ GVLPQPAPDG LFPLDFTENK DGTDG
|
| |