Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4031 |
Symbol | |
ID | 8335384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4562127 |
End bp | 4563455 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644957137 |
Product | type II secretion system protein E |
Protein accession | YP_003114740 |
Protein GI | 256393176 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4962] Flp pilus assembly protein, ATPase CpaF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.148527 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.00766622 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGTCG CGACCTTCGA ACGTGAGGCG CTGACCGATG ACGAGCAGCA CCTGGTGCGC AAGCTGCGCG CCTCGGTCGG CGCCCGGCTG GCCGAGCGCG TCGGCGACAA CTCCGGGAGC GTCGAGCGGG AGCGGGTCGG TCGGGAGCTG ATCGACGACG CGTTGATGGC CCACGCCCGC GCCGGGCTGG CCGGCGGCGG CCGGGACGTG TTGGACGGCC AGGCCGAGGC CAAGGTGAGC CGGGCGGTGT TTGACGCGCT GTTCGGCATG GGCGGTCTGC AGCGTTGGCT GGACGACCCG TCGGTGGAGA ACATCACGGC CAATGGCTTC GACCATGTCT TCGTCCACTA CTCCGATGGC AGCAAGGTCC CGGTTGGGCC GCTGGCCGCG TCCGAGGAGG ACTTCGTCGA CCTGCTGCGT ACGATCGCGG CACGGGCCGG CACCGAGGAG CGGATCTTCG ACCGTGCGCA TCCGCAGCTG AACATCCAGC TGCCCGACGG GTCCCGGCTG TTCGCGGTGA TGGCAGTGTC CCGCCGGGTG TCGCTGACCG TGCGCAAGCA CCGCCACATG GCCACCTCGC TGCGGGAGCT CGCGGCCCTG GGCATGTTCG ACCTCGACAT CGCCGCACAG CTGGAGGCCG CGGTACGGGC CAAGCTGAAC ATCCTCATCT GCGGCGCCAT GGGCGGCGGC AAGACCACGG TGCTGCGCGG CCTGGCCGCG TGCATCGGAC CGGAAGAGCG GCTGGTCACC ATCGAGGACA CCTACGAACT CGGGCTGGAG CAGGCGCACC CCGACGTGGT GGCGATGCAG GCCCGCGAGG GGAACCTGGA GGGCCAGGGC GCCGTGTCGC AAGCCGAGCT GGTGCGGATG TCGCTACGGA TGAACGCCTC CCGCGTCATC GTCGGCGAGG TCCGCGGCGA GGAACTGGTG CCCATGCTCA ACGCCATGAC CATGGGAACC GACGGCTCGC TGGGCACCAT CCACGCCTCG TCGTCCAAGC AGGCGTTCGA CAAGATGGCC ACCTACGCCA TCCAGTCACC GGAGCGCCTC GACCGCGCGG CGACGAACCT GCTGGTCGGC ACCGCCCTGC ACGTCGTGAT CCAGCTGGGT CGGCTGCGCG ACGGCACCCG CGTACTGTCC TCGATCCGGG AGATCACCGG CGTCGGGGAC AACGGCGAGG TCACCAGCAA CGAGGTCTAC AAGCCCGGCC GCGACGGCCA GGCCGTACCC GGAACCGGCT GGACCGCCGG CACCGCGCAG CGCCTGATCG ACGCAGGGCT GGATGAGGAT GTGCTCACGC GTTCGGCGCG CGCGGGGTGG TCGATATGA
|
Protein sequence | MSVATFEREA LTDDEQHLVR KLRASVGARL AERVGDNSGS VERERVGREL IDDALMAHAR AGLAGGGRDV LDGQAEAKVS RAVFDALFGM GGLQRWLDDP SVENITANGF DHVFVHYSDG SKVPVGPLAA SEEDFVDLLR TIAARAGTEE RIFDRAHPQL NIQLPDGSRL FAVMAVSRRV SLTVRKHRHM ATSLRELAAL GMFDLDIAAQ LEAAVRAKLN ILICGAMGGG KTTVLRGLAA CIGPEERLVT IEDTYELGLE QAHPDVVAMQ AREGNLEGQG AVSQAELVRM SLRMNASRVI VGEVRGEELV PMLNAMTMGT DGSLGTIHAS SSKQAFDKMA TYAIQSPERL DRAATNLLVG TALHVVIQLG RLRDGTRVLS SIREITGVGD NGEVTSNEVY KPGRDGQAVP GTGWTAGTAQ RLIDAGLDED VLTRSARAGW SI
|
| |