Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4304 |
Symbol | |
ID | 8335658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4883528 |
End bp | 4884556 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644957407 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_003115009 |
Protein GI | 256393445 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0262518 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00234368 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCTCGT TCGCCACCGT CCCGTACAGC CGCCGCGGAT TCCTCGGCCT GGTCGGCACC GCCGCGCTCG CCGCCGGCTG CGGCTCGGGC GCAGCCGGCT CGGCCAAGAA GACCACCAAA CTTCGCTACC AGGGCTCGGT GGGCACGGTC ACGCCGCCGG AACTCGCCGC AGACCTCGGA TATCTGGGCC CTGTGACGCT CGACTGGGTC GGCAACACCA CCAGCGGTCC GCAGGACATC CAGTCGGCGG CCACCGGGCA GACCGACTTC GGCGGAGCCT TCAACGGCGC GGTCGCCAAA CTGCACTCCG CCGGCGCCCC GATCACCGCC GTCATCAGCT ACTACGGCGT CGATCAGTAC TCCTACAACG GCTTCTACAC CCTGGAAGGC AGCCCGATCG CCTCCGCGCC CGACCTGTTC GGCAAGAAGG TCGGCATGAA CACCCTCGGC GCGCACTACG AGGCGGTCCT GGACATCTAC CTGAGCCGCA ACGGCGTCTC GGACAGCGAC GCCAAGAAGG TCGAGCCGCT GGTCGTGCCG CCGGTCAACA CCGAGCAGTC GCTGCGCGCG CACCAGATCG ACGTCGCCAC CCTCGGCGGC ATCCTGCGCG ACAAGGCGCT CGCCGACGGC GGCGTGAAGC AGCTGTTCAC CGACTACCAA CTGCTCGGCA CGTTCAGCGC CGGGACGTAC GTCTTCCGCA ACGACTTCCT GGCGAAGAAC CCTGACACGG TCCACGCCTT CACCTCCGGC GTCGGCAAGG CGATCGAGTG GGCCCGCACC ACACCCCTGC CGGAGGTCGT CGACCGCTTC ACGAAGATCA TCAAAGCCCG CGGCCGCAAC GAGGACACCT CGACCCTGAA GTACTTCAAG TCCTACGGGA TCGCCGGCAC CGGCGGCGTC GTCGCCGCCA AGGAATTCGA CACCTGGATC ACCTGGCTCG AACAGCAGGG CCAGATCCCC AAGGGCAAGG TCAAGGCCAC CGATGTCTAC ACGAACAAGT ACAACTCCTT CGCCAACGGC GGCAGCTGA
|
Protein sequence | MSSFATVPYS RRGFLGLVGT AALAAGCGSG AAGSAKKTTK LRYQGSVGTV TPPELAADLG YLGPVTLDWV GNTTSGPQDI QSAATGQTDF GGAFNGAVAK LHSAGAPITA VISYYGVDQY SYNGFYTLEG SPIASAPDLF GKKVGMNTLG AHYEAVLDIY LSRNGVSDSD AKKVEPLVVP PVNTEQSLRA HQIDVATLGG ILRDKALADG GVKQLFTDYQ LLGTFSAGTY VFRNDFLAKN PDTVHAFTSG VGKAIEWART TPLPEVVDRF TKIIKARGRN EDTSTLKYFK SYGIAGTGGV VAAKEFDTWI TWLEQQGQIP KGKVKATDVY TNKYNSFANG GS
|
| |