Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4178 |
Symbol | |
ID | 5703966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4746104 |
End bp | 4747072 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641273605 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_001538958 |
Protein GI | 159039705 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.848155 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0018252 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTGACT TCGAGACGGT GGCGGCATCG GAGGACCCCA ACGCCCGGCG GGGGCCCTCC GGCGAGCCGA ACGCACCGAA CCAGGTCGGC GGCACCTCCG TGAGCGCCGG CCGACCCCGA AGCCTCGCCG GCGACGCCTG GCGTGACCTC CGGCGCAAGC CGATCTTCTG GATCAGCCTC ACCCTGGTCC TGGTGGTCAC CGCGATGGCC GCCGTACCGG CGCTGTTCAC CAACGCCGAC CCGGCCGACT GCATACTCGC CCGGCAGCAC GCGGCACCGT CCGGCGGGGC CATCTTCGGG TACGACTTCC AGGGCTGCGA CACCTACGCC CGGGCGGTCT ACGGAGCCCG AGCCTCGCTG CTGGTCGGGG CCCTTTCGGC GCTGCTCACC GGACTTGTCG CGCTCACCGT CGGCATGCTC GCCGGATACT TCGGCCGGTG GGTCGACGCG GTCCTCTCCC GGGTGATCGA CATCGTGCTC GGCATCCCCC TGCTACTCGC CGCGATCGTG CTGCTGAAGC GGGTCGCCAG CGACAGCGCG ACGGTCCGGA TCGCGGCGGT GATCTTCGTC CTGGCGGTCC TCGGCTGGAC GACGACGGCC CGGGTGGTCC GCTCGTCGGT CATCACGGCA AAGGAGCAGG ACTACGTCGC CGCCGCCCGG ATGCTCGGTG CCGGCAACGG CCGGATCATG TGGCGACACA TCCTGCCCAA CACGCTCGCC CCCGCCATCG TGGTACTGAC CATCGCTCTC GGGTCGTTCA TCGCGGCCGA GGCGACGCTC TCCTTCCTCG GCATCGGCCT CAAGGAACCC ACGATCTCCT GGGGCATCGA CATCAACAGT GGTCGGGTAC ACATGCGCGA ATCGGCCACC CCACTGGTCG TCCCGTCGAC CTTCCTCGCG CTGACCGTGC TGGCGTTCAT CATGCTCGGC GACGCGATCC GGGACGCCTT CGACCCGAAA CTCCGGTGA
|
Protein sequence | MSDFETVAAS EDPNARRGPS GEPNAPNQVG GTSVSAGRPR SLAGDAWRDL RRKPIFWISL TLVLVVTAMA AVPALFTNAD PADCILARQH AAPSGGAIFG YDFQGCDTYA RAVYGARASL LVGALSALLT GLVALTVGML AGYFGRWVDA VLSRVIDIVL GIPLLLAAIV LLKRVASDSA TVRIAAVIFV LAVLGWTTTA RVVRSSVITA KEQDYVAAAR MLGAGNGRIM WRHILPNTLA PAIVVLTIAL GSFIAAEATL SFLGIGLKEP TISWGIDINS GRVHMRESAT PLVVPSTFLA LTVLAFIMLG DAIRDAFDPK LR
|
| |