Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4598 |
Symbol | |
ID | 5706619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5216949 |
End bp | 5218433 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641274002 |
Product | ABC transporter related |
Protein accession | YP_001539349 |
Protein GI | 159040096 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG1122] ABC-type cobalt transport system, ATPase component [COG1126] ABC-type polar amino acid transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0433816 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGCA TCGACCACGC TACCTGGACC TACCCGCACG CCGAGCAGCC GAGCCTGCGA GACCTCACCC TGCGCGTCAA CCCGGGTGAG TTCGTGATCC TCTGCGGCGC GTCAGGATCC GGCAAATCCA CCGCACTCCG GCTCATGAAC GGCCTCATTC CGCACTTCCA CGAGGACGGC GTCCTCACCG GCACCGTCAC TGTCGGTGGG CTGGTCACGA CCAATGCCGA GCTGGACGCT ATGGGCCTCG TCACCGGCAC CGTGCTGCAG CATCCGAAGC GCCAGTTCTT CACCGACACC GCTCCGGAGG AGGTCGCCTT CGCGATGGAG AACTTCGGCT TCCCCCCGGA AGAGATCCGG CGCCGCGTCG TGGAGACGGT CGAGGAGCTC GCCACGGGGG TGCCGGTCGA ACAACGCCTG CGGGATCTCT CCGGCGGTCA GCAGCAACAG GTCGCGATCG CCGCTGCGAT CGCCCACCGC CCCAGCGTCC TCCTGCTGGA TGAGCCCAGC TCGAACCTCT CGTCCGACGC GGTGCAGCGC CTCACCGCCA CGCTCGCCAG CCTCAAGGCG CAAGGAGTGA CGATCGTGAT CGCCGAACAC CGACTGCGCT ACCTGGAAGA CCTCGTCGAC CGGGTCATCG TGATGCGCGA CGGCGCGATT GACGTCGAGT GGCCCGCGGC GCAACTCCGT GCCGTGCCGG ATGACGAGCT CGCCCGCGAG GGACTCCGCG GGGTCGTGAG CACGGTCGAT CTGCCGGCCC TGCCGGCATC AGGCGCCAGC ATCGTCGCAG GAGCCGACGC ATCGGAGATC CCGGGCGCCG CGCTCGAACT GGAGGCGATC CGCTGCCGCC TCGGCGGGCG CATCGTGCTC GACATCGACC GCGTCGCCTT CGCGGACGGC TCGGTCACCG CGGTCCGCGG AGTCAACGGT GCAGGCAAGT CCACTTTCGC TCGAATCATG ACCGGCCTGC AACGCAGCAC CGGCACCGTC TTCCTCGATG GGAAGGCGCT GAACCCGCGG GCACGCCAGC GCGCGAGCGC GATCGTCATG CAGGACGTCC AGCGGCAGCT TTTCACCGAC AGCGTCAAAG CCGAGATCCA CCTCGCCGGC ACCGACACCC CCGAGGCTCC TGATACGGAT ACGGTGCTCG ATGCCCTCGA CCTCGCGCAC CTCGCCGACC GGCATCCGCT GTCGCTCTCC GGTGGCCAAC AGCAGCGCCT CGTCGTGGCT GCCGTCCGGG TTGCTGGCCG ACGCATCGTC GTGTTCGACG AGCCCAGCTC CGGCGTGGAC CGCCGCCACC TGCGGTCCAT CGCCGACCAG ATCCGCCGCC TCGCCGCCGA CGGCGCCATC GTCCTACTCA TCAGCCATGA CGACGACCTG CTCGCGCTCG CCGCAGACCG GCAACTCACT CTGGCCCCAC CGCTGAGCTC GTCGCGGAAC CGGCATGGCG CTCACGGAGA ACCGACCGTT GAGGAAACCC GATGA
|
Protein sequence | MIRIDHATWT YPHAEQPSLR DLTLRVNPGE FVILCGASGS GKSTALRLMN GLIPHFHEDG VLTGTVTVGG LVTTNAELDA MGLVTGTVLQ HPKRQFFTDT APEEVAFAME NFGFPPEEIR RRVVETVEEL ATGVPVEQRL RDLSGGQQQQ VAIAAAIAHR PSVLLLDEPS SNLSSDAVQR LTATLASLKA QGVTIVIAEH RLRYLEDLVD RVIVMRDGAI DVEWPAAQLR AVPDDELARE GLRGVVSTVD LPALPASGAS IVAGADASEI PGAALELEAI RCRLGGRIVL DIDRVAFADG SVTAVRGVNG AGKSTFARIM TGLQRSTGTV FLDGKALNPR ARQRASAIVM QDVQRQLFTD SVKAEIHLAG TDTPEAPDTD TVLDALDLAH LADRHPLSLS GGQQQRLVVA AVRVAGRRIV VFDEPSSGVD RRHLRSIADQ IRRLAADGAI VLLISHDDDL LALAADRQLT LAPPLSSSRN RHGAHGEPTV EETR
|
| |