Gene Svir_33020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSvir_33020 
Symbol 
ID8388626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharomonospora viridis DSM 43017 
KingdomBacteria 
Replicon accessionNC_013159 
Strand
Start bp3582677 
End bp3584017 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content66% 
IMG OID644977326 
Productarabinose efflux permease family protein 
Protein accessionYP_003135095 
Protein GI257057263 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.471979 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCCA CTTCGATATC GCCGGACAAG GCGAAGAAGG CGAGAACCGT CGCGGTCGCC 
AGCTACATCG GCACGACCAT CGAGTGGTAC GACTTCTTCA TCTACGGCGT CGCCGCGACG
CTGGTCTTCC GCACCCAGTT CTTCCCGGAG TTCTCCGAAC TCGGGGGTAC GCTGGCCGCG
CTTGCGACCT TCGCGGTCGG TTTCATCGCA CGTCCCATCG GCGGGATCGT GATGGGGCAC
TTCGGTGACC GGGTCGGTCG TAAGTCGATG CTCGTCACCT CGCTGCTGAT CATGGGCATC
GCGACGACCC TCATCGGCGT GCTCCCGACG TACGACCAGA TCGGTGTCTG GGCGCCCATC
CTGCTCGTCG TGCTGCGGCT CGCCCAGGGC GCCGGTGTCG GGGGTGAATG GGCGGGTGCG
GTGCTGATGG CCGTCGAGCA CGCACCGCCG AAGCGACGGT CGTTGTACGG CTGCTTCCCG
CAGTTGGGTC TGCCCTCGGG AATCCTGCTG TCCCAGTTGG TCTTCCTCGT GCTCACGGGC
GTGTTGCCGG AGGCTGCGTT CAGCGCCTGG GGTTGGCGTA TCCCCTTCCT GATCAGCGCG
GCGTTGATCC TGGTCGGACT GCTCATCCGG CTGCGGATCG AGGAGAGCGC GGACTTCGAA
CGCGTCCGCG CGGCGGGGCA GGTCGAGAAG CTGCCGGTGG TGCAGGTCTT CCGGCGTACC
CCGCTGCAGG TGCTCGTGGG TAGCGTGGCT TCGATCGCCG CTCCGACCCT GGGCTACCTG
GTGTCGGTCT ACATGGTGTC CTACGGCACC AACACCCTCG AGCTGCCCAC GACGACCATG
CTGTGGACGC TCGTCGGCGT GAGCGTGTTG TGGAACGGCA TCATGTTGGC GGCCGGTCTG
GCCGGTGACG TGCTGGGCCG CAAGCCGACG TTCCTCATCG GGGCCGCGCT GTCGGTGGTG
TGGGCCTTCC CGATGTTCTG GCTCGTGGAC ACCGGGTCCC TCTTCTGGAT CTTCGTGGCC
CTGGTCGTCA TCACCGCGGC CAACTCCATC ATGGCGGGAC CGCAGCCGGC ACTGGTCACG
GAGATGTTCC CCGTTCGGCT GCGCTACAGC GGGTCGTCGA TCTGCTACCA GATCGGATCG
ATCATCGGCG GCGGTGTGGC CCCGATCCTG GCGACGACAC TGTTCGCCAA GTTCGGCAAC
CCCGCGGTCT CGACGCTGAT CGTCGTCATC TCGCTGCTCA GTCTGTTCGC GATCCTGTTC
GCGGGCCAGC GGATCCTGCA AGCACAGGAA CCGACGGCCC CGCAGGCCCA GCAGAGCCGA
CTGCAACCGC TCGCCGAGTG A
 
Protein sequence
MSSTSISPDK AKKARTVAVA SYIGTTIEWY DFFIYGVAAT LVFRTQFFPE FSELGGTLAA 
LATFAVGFIA RPIGGIVMGH FGDRVGRKSM LVTSLLIMGI ATTLIGVLPT YDQIGVWAPI
LLVVLRLAQG AGVGGEWAGA VLMAVEHAPP KRRSLYGCFP QLGLPSGILL SQLVFLVLTG
VLPEAAFSAW GWRIPFLISA ALILVGLLIR LRIEESADFE RVRAAGQVEK LPVVQVFRRT
PLQVLVGSVA SIAAPTLGYL VSVYMVSYGT NTLELPTTTM LWTLVGVSVL WNGIMLAAGL
AGDVLGRKPT FLIGAALSVV WAFPMFWLVD TGSLFWIFVA LVVITAANSI MAGPQPALVT
EMFPVRLRYS GSSICYQIGS IIGGGVAPIL ATTLFAKFGN PAVSTLIVVI SLLSLFAILF
AGQRILQAQE PTAPQAQQSR LQPLAE