Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3922 |
Symbol | |
ID | 8335275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4453722 |
End bp | 4454762 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644957046 |
Product | ABC-type nitrate/sulfonate/bicarbonate transport systems periplasmic components-like protein |
Protein accession | YP_003114649 |
Protein GI | 256393085 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.679528 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.204471 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGAACCC AGACTCGCTC CACCCGCATA TCCCACGCCA TCGCTCTGGC TACGGCCGTC GTGCTCGGCG CGGGGCTGCT CACCGCGTGC GGCAGCTCCT CGTCCTCGTC GGGCTCAGGC GGCGACACGG TCACCGTCGG CGTCAGCAAC AACATCTTCG ACGTGCCGAT TCGGCTGGCC GACTCCAACG GCTACTTCGC CAAACAGGGC CTGAAGGTCA AGTACGTGAC CATCACCGCC TCGACCGGAT CCTCAGCTCT GCAGTCGGGA TCCGTCCAGT TTCTCAACGA CAGCCCGACC GCCTTCCTGT CCGCCATCAG CAAGGGCATC CCGCAGACCG CGATCGCCGC GAACGCCGGC GGGAACCCGC TCGGCCTCAT CGTCAGCACG AAGTTCGCCA AGGCGCACCA GCTGACCGCT GACAGCACCG CGGACCAGGC CGCCGCAGCC CTGGCCGGCT CCACCGGCGG CGCCAGCTCG GCCAACACCA AGGGCGAGGC GAGCATCTAC CTCAAGAAGT ACGGCGTCGA CCCTGGCAAA GTGAAATGGG TCTCCCTGCC GAGCCCGACC GCCGACAACG CGGCCCTGAA GAGCGGCCAG ATCGACTGGT TCGTCACCTC CGAGCCGGCT CCGCTGCAGA TCCAGGAGAC CGGCGACGGC ATCGTCGTCG CGGACTCGAC CAAGGTGCCC GAGTGGTCCT CGGCGCAGGC CGGATACGGG CAGTTCGTCG TCGCCAGCAA CAGCTACCTC AGCCAGCACG CCGCCACCGC GAAGAAGTTC GTCACCGCGG TGCAGCAGGC CACGGCATAC ATGAACGCGA ACGTCGTCTC GCCCCCCGTG CTGACCGCGA CCCAGGCCGC GTTGCCCGGA GTGTCCGCCA CAGCGCTGCA GGCCAGTCTC CGGCAGGTCG AGTGGCCGGT CAGCGAGGCG ATGAGCCCTG AGGGCTGGAC CAAGGTCCTG GCCTTCATCA ACTCCCTCGG CGCGGTGTCC CAGAAGGCCG TCATCTCAGA CAGCGACTGG ACCAACAAGT ACCTGCAGTA G
|
Protein sequence | MRTQTRSTRI SHAIALATAV VLGAGLLTAC GSSSSSSGSG GDTVTVGVSN NIFDVPIRLA DSNGYFAKQG LKVKYVTITA STGSSALQSG SVQFLNDSPT AFLSAISKGI PQTAIAANAG GNPLGLIVST KFAKAHQLTA DSTADQAAAA LAGSTGGASS ANTKGEASIY LKKYGVDPGK VKWVSLPSPT ADNAALKSGQ IDWFVTSEPA PLQIQETGDG IVVADSTKVP EWSSAQAGYG QFVVASNSYL SQHAATAKKF VTAVQQATAY MNANVVSPPV LTATQAALPG VSATALQASL RQVEWPVSEA MSPEGWTKVL AFINSLGAVS QKAVISDSDW TNKYLQ
|
| |