Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4413 |
Symbol | |
ID | 8335767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5009890 |
End bp | 5012895 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644957516 |
Product | hypothetical protein |
Protein accession | YP_003115118 |
Protein GI | 256393554 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.107234 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGTGG ATCCCGAGGA CAGGTTCGAT CTGCCGCGAA AGTTCCAGCA CGGAGTCATC CGTCGGGAGC GCTGCGATCG GTCTGTCCTG GAGGCGGTTT CGACCGCCGG GTCCCCAGAT CCGCAGGACA TTGCGACGTG GCGGAGCATC GATCGGCTCG TCGCGCAGGA CGGGCCGGCG GCGGCTGTGC GCTTCTTCGC CGAGCTGAGC GCCGAGGACG TGAGCTTCAG CCGGCAGAGC TGGAGTCGCG GCAGCGTGTA TGACAGCACG CTCCGCGACC GTCGCTGGCC TTTGATGCTG CGGCTGCTCG ATCTGGTGCG CTCGGGCACG GACGAGGAGT ACGGCGAGGC GATCGCGACC GCGGAGAAGC TGCGGGACGA GTTCCCGGTC CCGGCGTCGC GGGCGGCGAT GGCGGTCCTC GCCCTCGACC GGCCGGAGTG GTGCTCGGCG GACATCGCCG ACTACGCCGA GGACGGCGAG GGCAGCGAGG ACGGCGAGGG CAGCGAGGAC GGCGACGACG AGCACCACGT CGCCAGGCTC CTGATGCTCG CGGTGACGAC CCGGGAGCAG GCGCGGGAGC TGGGGCGGAA GATCCGGCAC TGGGAGTGGG TCAACGACGA CCGGCGCGCC GAGGCGACGT TCCTGGCCAC GATCGGCGCC GGCGCCGACG AATTCCTCGT CGGCTGCTTG GAGGGGAGCC GCTGGCACGC GCTCAACCTG CTGCCGAAGC TGCCGAGCGA GCTGGCTGTC AACACCCTGA TCGGGGAGCT GGGTCCGAAG GACGCGCAGG CTGCGCTGCT GGAGGCGGCG AAGCGGTTCC CGCGGCGGGT GCTGCGGTTG GTCGCCGAGG CTGAGCCGGC GACGCAGGTG GACTATGTGC TGCGGCTGCA TGTCGCGGCC GATCCCGCGT TGGCGCGGGA GGAGTTGGGC CGGCTTTCGG AGGCGGCGCG GCGGCGGGTG GAGGCTTTGC TGGGTCCGTC GGGGGTTGCC GGGTCCGGCG GCGGAGGCGG CGGGGGTACT GACGGCGCCG GCGGTGGTTC GGTTCCGGTG GCCGAGGAGG GTGACCTGCC GCGCGTCCTG GTGGTACCAC CGTGGGCGGA TCGTGAGGCG GGCAAACCGA TCGCGCTGAA GAACCTGCCG ACGCCGGTGC CGTTCTTCCG GTGGCCCGCG CGGGAGACCG CCGGACCGAC CGCGATTCCC GAGCCGGCGC CGTCGCCGGA CGAGGCGGGG AATCAGAACA AGATCGTGCT GACCATGGCG GAGTGGCTGA ACTCCGAATA CGAGCCGTAC TCGGAGGCCG CCGACGTCTT CTTCCGCCGC TATCCCGGGC AGGCGGTGCG TCAGCTGCTG GTCTGTGCGC TCGGGACGGC CGGGCCGGAG CGTCGCGTCG CCGAGGCCGC GGTGCGTTTC GTGGCGCGGC AGGGCTGGGC CGACGTCGTG GCGCTCGCCG GGGAGTCGAG CCCGGAGGCG GCGGTCGCGA CGCGGGCGCT GCTGGAGCGG CGCGGGCTGG ACACGTTTCC GAAGTCGATG CCGGCGGTGC CCTTATGGGC CGATCCGGCT TCGCTGCCTG GGATTGTGCT GGCGGGGCGG CGCGCGGTGT TGCCGGCGTC GGCGGTGCGG GTCGTGGTGC AGATGCTGAC GATCTCCTCG CGCCGGGAGG TCGCACCGTA TGGCGGGATC GAGATCGTGA AAGAGGCCTG CGATGCACGG TCGCTGGCGG AGTTCGCGTG GGGGCTGTTC GAGAACTGGG CCGGGGCGGG GTTCCCGACG AAGAAGCTCG GATGGGCCTT CGACACGCTG TTGTGGTTCG GGGACGGCGA GACCGCGCGG CGGCTCGCGC CGCTGGTGCG GGCGTGGCCG GGCGAGGGCG GGTCGGCGCG GGCGGCGGGC GGGCTGGACG TGCTGATCGC CGCGGGCGGG GAGGCCGGGC TGCGCGAGGT CTACGACATA TCGCAGCGTT CGACGTTCGC CGCGCTGCGG GCCGAGGCCT CGCGGCGGGT CGCCAAGGCG GCTTCGGCGC GAGGGCTGAG CGCCGACCAG TTGGAGGACC AGCTGGTCCT GGATCTGGGC GCCGGGCGAG ACGGGACGCT GGAGGTGGAC TTCGGGGCGC GGCGGTTCAC GGTCGGGTTC GACGAGTATC TGGCGCCGTT CGTCACCGAC GGCGCGGGCA AGCGCCGGGC GTCGCTGCCG AAGCCGACGG CCAAGGACGA CGCGGTACTG GCGGCTGCGG CGCAGCGGCG GTTCACCGAG CTGAAGAAGG ACGCGAAGAC GTTCGCGGTC CGGCAGGCGG CGCGGCTGGA GCAGGCGATG GTGGGCGGGC GGCGCTGGTC GGAGGGCGAG TTCCAGGCGG TGTTCGTCCA GCATCCGCTG CTGCGGTTGC TCGGCCGGCG GCTGGTGTGG GGCGAGTTCG ATGGTGAGGG GGCGCTGCGT GCGGCGTTCC GGATCGCGGA GGACGGGACG TTCGCGGACG TCGCCGACGA GCGCTTCGTG CTCAACTCAA GAGAGGCCGG CAGCTGCGGC ACGATCGGCG TCGTGCACCC CCTCCACCTG GGCGCCGACC TCCCCCGCTG GGCCGAGATC TGCCACGACT ACGAGATCAT CCAACCCTTC CCCCAAATCG GCCGCCCCTT CTTCACCCTG ACCCCCGCCG AGCGCACAGC GACCCGCCTG GACCGCTTCT GCGACGCCGA CCTCGCCACC GACCGCTTCC TCGCCCTGCA CCGCAGCCCC GGCTGGGGCG GCGCCGCCCT CTGGGACTCC GGCAGCACCG TGGCGACCGG CCGCCGCCTC CCCCACCACC GCATCCTGAT CGTCCGCATC ACCCCCGGCT ACCGCGAAGG CCGCCTGGCC GACACCCCCC GCCAAACCAT CACCGACATC TGGCTCAGCC CCACCGACAG CCACGGCCGC CGAGGCGCCC GCACCGAAGC CCTCCCCCTC GGCGGCATCG AACCGCTCAC CGCCAGCGAG ATCATCGCCG ACTTGGCGGC GGCGGTGGGG ACGTGA
|
Protein sequence | MSVDPEDRFD LPRKFQHGVI RRERCDRSVL EAVSTAGSPD PQDIATWRSI DRLVAQDGPA AAVRFFAELS AEDVSFSRQS WSRGSVYDST LRDRRWPLML RLLDLVRSGT DEEYGEAIAT AEKLRDEFPV PASRAAMAVL ALDRPEWCSA DIADYAEDGE GSEDGEGSED GDDEHHVARL LMLAVTTREQ ARELGRKIRH WEWVNDDRRA EATFLATIGA GADEFLVGCL EGSRWHALNL LPKLPSELAV NTLIGELGPK DAQAALLEAA KRFPRRVLRL VAEAEPATQV DYVLRLHVAA DPALAREELG RLSEAARRRV EALLGPSGVA GSGGGGGGGT DGAGGGSVPV AEEGDLPRVL VVPPWADREA GKPIALKNLP TPVPFFRWPA RETAGPTAIP EPAPSPDEAG NQNKIVLTMA EWLNSEYEPY SEAADVFFRR YPGQAVRQLL VCALGTAGPE RRVAEAAVRF VARQGWADVV ALAGESSPEA AVATRALLER RGLDTFPKSM PAVPLWADPA SLPGIVLAGR RAVLPASAVR VVVQMLTISS RREVAPYGGI EIVKEACDAR SLAEFAWGLF ENWAGAGFPT KKLGWAFDTL LWFGDGETAR RLAPLVRAWP GEGGSARAAG GLDVLIAAGG EAGLREVYDI SQRSTFAALR AEASRRVAKA ASARGLSADQ LEDQLVLDLG AGRDGTLEVD FGARRFTVGF DEYLAPFVTD GAGKRRASLP KPTAKDDAVL AAAAQRRFTE LKKDAKTFAV RQAARLEQAM VGGRRWSEGE FQAVFVQHPL LRLLGRRLVW GEFDGEGALR AAFRIAEDGT FADVADERFV LNSREAGSCG TIGVVHPLHL GADLPRWAEI CHDYEIIQPF PQIGRPFFTL TPAERTATRL DRFCDADLAT DRFLALHRSP GWGGAALWDS GSTVATGRRL PHHRILIVRI TPGYREGRLA DTPRQTITDI WLSPTDSHGR RGARTEALPL GGIEPLTASE IIADLAAAVG T
|
| |