Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4180 |
Symbol | |
ID | 5703968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4748147 |
End bp | 4749187 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641273607 |
Product | oligopeptide/dipeptide ABC transporter, ATPase subunit |
Protein accession | YP_001538960 |
Protein GI | 159039707 |
COG category | [R] General function prediction only |
COG ID | [COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase |
TIGRFAM ID | [TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00217301 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGCGC CCGCTAGCCC ATCGAAAAAG GTACGCGGCG AGCCGATCCT CGCCGTCGAG AACCTGGTCA AACACTTCCC GATCACCCGG GGCGTCGTCT TCCAACGGCA GATCGGCGCG GTACGGGCCG TGGACGGGAT CAGTTTCGAC CTGCGCCGCG GCGAAACCCT CGGCATCGTC GGCGAGTCCG GCTGCGGTAA GTCCACCCTG GCCCGGCTGC TGATGCGACT GGAGACGCCG ACCTCCGGAC GAGCGACCCT GGAAGGCCGT GACCTGTTCG CCGCCCGCGG CACCGAACTG CGCCGGCTGC GCCGCAACAT GCAGATGGTC CTCCAGGATC CGTACACCTC GCTGAACCCG CGGATGACCG TCGGCGACAT CATCGGTGAA CCGTTCGAGA TTCACCCAGA GGCAGCGCCC AAGGGCAGCC GGCAGCGGCG GGTCCAGGAA CTGCTGGACA TGGTCGGGCT CAACCCCGAG CACATCAACC GGTACCCGCA CCAGTTCTCC GGCGGCCAGC GGCAGCGCAT CGGCATCGCC CGCGCGCTCG CCCTGCGCCC CGAGGTGATC GTCTGCGACG AGCCGGTGTC CGCGCTGGAC GTGTCGATCC AGGCGCAGGT GATCAACCTG CTGGAGCAGC TCCAGGACGA GCTCGGCCTC TCGTACATCT TCATCGCCCA CGACCTGTCG GTGGTACGGC ACATCTGCGA CCGGGTCGCG GTGATGTACC TGGGCCGAAT CGTGGAAATG GGCACCGAGG AGGAGATCTA CCAGCGGGCC ACCCACCCGT ACACCCAGGC CCTGCTGTCC GCGGTGCCGG TGCCCGACCC GGAACAGCGG AACAACCAGA ACATGATCCG GCTCGTCGGC GACGTACCGA GCCCGGCGAA CCCGCCGAGC GGCTGCCGGT TCCGCACCCG GTGCTGGAAG GCACAGGACA TCTGTGCCAC CCAGGACCCG GACGCCGTTC CCCGCGCCGC CGACCCGCAC CCGTCGGCCT GCCACTTCGC CGAACTCCGC GAACCGGCGT CCCCCTCCTG A
|
Protein sequence | MTAPASPSKK VRGEPILAVE NLVKHFPITR GVVFQRQIGA VRAVDGISFD LRRGETLGIV GESGCGKSTL ARLLMRLETP TSGRATLEGR DLFAARGTEL RRLRRNMQMV LQDPYTSLNP RMTVGDIIGE PFEIHPEAAP KGSRQRRVQE LLDMVGLNPE HINRYPHQFS GGQRQRIGIA RALALRPEVI VCDEPVSALD VSIQAQVINL LEQLQDELGL SYIFIAHDLS VVRHICDRVA VMYLGRIVEM GTEEEIYQRA THPYTQALLS AVPVPDPEQR NNQNMIRLVG DVPSPANPPS GCRFRTRCWK AQDICATQDP DAVPRAADPH PSACHFAELR EPASPS
|
| |