Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2001 |
Symbol | |
ID | 5704366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2298780 |
End bp | 2300084 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641271497 |
Product | major facilitator transporter |
Protein accession | YP_001536868 |
Protein GI | 159037615 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.548346 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00155149 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGCTGTCGC TGCTGCTCGT GGTGTCCAGT GGGCGCATGA CACCCGCGAC CGTGGTGCGG CGATTCCCGT ATTCATCGCC GTACTGGCCT GTGCTGAGCA ATTCGTTGCT GCGTCGGATC CTGCCCGGCC TGACGGTGTC CGCCCTGGGC GATGGCATGG CGGTCGTGGC GGTGAGCTGG TTGGCGTTGC AATTGGCGTC GCCGGGGCAG CGCGGCCTGT GGGTGGCGAT CGCGGTAGCG GCTTACACCG TCCCCAGCGT GGTCGGCACG CTGGTGTTCG GCCGGGTGTT GGGCGGGCGG AACGGTGCGC AGTTGGCCGG GTGGGACGCC ACCCTGCGCG CGGGGACGCT TGCGGCGATT CCGGTCGCCT ACCTCTTTGG GGCGTTGAGC CTCGGGTTGT ACGTGACCTT GTTGGCCGTC TCGTCGCTGT TGCACTCGTG GGGTTCGGCG GGACGCTACA CGCTGATCGC CGAGGTGCTC CCGGTACGCG ACCACCTGGC GGGTAACGCC GTCCTCGGCA TCATCGCCGA GATGGCCACC ATCGGTGGGC CTCCGCTGGC CGGACTCCTG ATCAGCTGGG GCGGAGCAAT CTGGGTGATC GCCATCGACG CAGCCACCTT CGCCGTCCTC GCGCTCACTT ACCGGCTGGC TGTACCCGCC GCCGACAGAC CGGCACCGGC GCAAACTGGC GCTTCCCGCA CCGCCGGCTT CGGCGTCATC CGCCGCAACC GCAGCCTGCT CGGCCTGCTT ACCCTGAGCT TCGGGTTCTT CTTCCTCTTC GGCCCCGTCT ACGTCGCCCT TCCCCTCTAC ATCACAGACG AACTGCACGC CTCGGCGACC CTGCTCGGCA CTTATTACAT GGCATTCGGT GCGGGTGCCC TCGTCGGCGG CTTGACCGTG GGCTACCTAC GCCGCAGGCC GCTGTGGGTC GTCACCATCG GCATCGTCGT GGGCTTCGGT CTCACCATGC TGCCCCTCGG GCTGGGCGCA CCCGTCAGTT TGTCCCTGCT GTCCTTTGCC ATCGGCGGGG CGGTGTGGGC GCCGTACATG CCCACGTCGA TGGCGTTGTT CCAACGCAGT ACCACGGCCG CGAACCGCCC GCAGGTCCTC GCCGCCAACG GCGCCGTCAC CGTGGTGGCG GTACCGGCGG GCACCATGCT CGGCGGCCCG CTCGTGAGTG CCCTCGGCGC CCACGAGACG TTGCTGTTCT GCGCTATCGC CATCATCGCC TTCGGAGTGA TCGCCACCGG CTTGACTGTC CTCCATCGCC TCGCGCCTCC CGTCGGCGAC ACCGAGAGGG AGTGA
|
Protein sequence | MLSLLLVVSS GRMTPATVVR RFPYSSPYWP VLSNSLLRRI LPGLTVSALG DGMAVVAVSW LALQLASPGQ RGLWVAIAVA AYTVPSVVGT LVFGRVLGGR NGAQLAGWDA TLRAGTLAAI PVAYLFGALS LGLYVTLLAV SSLLHSWGSA GRYTLIAEVL PVRDHLAGNA VLGIIAEMAT IGGPPLAGLL ISWGGAIWVI AIDAATFAVL ALTYRLAVPA ADRPAPAQTG ASRTAGFGVI RRNRSLLGLL TLSFGFFFLF GPVYVALPLY ITDELHASAT LLGTYYMAFG AGALVGGLTV GYLRRRPLWV VTIGIVVGFG LTMLPLGLGA PVSLSLLSFA IGGAVWAPYM PTSMALFQRS TTAANRPQVL AANGAVTVVA VPAGTMLGGP LVSALGAHET LLFCAIAIIA FGVIATGLTV LHRLAPPVGD TERE
|
| |