Gene Sros_9021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_9021 
Symbol 
ID8672363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9965010 
End bp9966575 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content70% 
IMG OID 
ProductABC-type sugar transport system periplasmic component-like protein 
Protein accessionYP_003344395 
Protein GI271970199 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGA TCCGACGCCC GCGTGATCTG CTCCGCCGGT TGGATCCGGC CCAGCGCCGC 
TGGGGCGGCG CCGGGCTGGT GGCCGGGCTG GCCCTGGCGT CGGTGCTCGT CTCCCTCACC
GGGAAGGGCG CCCCGCCCTC CGTCTCCTCC TTCACCGGCT GCGGCGGGCC GAACCGGCTC
ACCGTGGCGA CCGGCGCCGA CATCACCGCC GGGAGCTTCC GCCGCGACCT CATCGAGGAG
TGGAACCGTA CGCACACCAC CCAGGTCACC CTGGTGGAGG TGGCCGACCG CACCGACGAG
GAGCGGGCCG AGATGGTGGG CCGGGCCCGG CTGGGGAGCT GCGCCTACGA CGTGTTCTCC
GTGGACGTCG CCTGGATGGC CGAGTTCGCC CGCAGCGGCT ACATCCGCCC GTACCCTCTC
GAACCTGGAG AGGCCAACCG CTACGTGGAC AACGTTCTGA AAGCCGGCCA GGTGGACGGT
GTCCAGTACG CGGTGCCCTT CGTCACCGAC GTGCCCCTCC TCTTCCACCG TCGTGGCGTA
CCCGTGCCCG CCACGATGGA GCAGCTCTGG AAATACGCCG CCGAGAACGG CGGATACTCC
GTACAGCTCG GCGACTACGA GGGAGGGACG GTCAACCTGC TGGAGGCGAT CCGGTCCGCG
GGCGGGAGGG TCACCGACGG CGAGCGGATC GTGGTGGACG AGGGAGACTC CGCTGAGGCC
GTGCGCGAGG CGCTCTCCCG CTGGCACGCG CTGCTGGAGA AGGGGGTCCT GGCGCGCGGC
GCGGAGAACT TCTCCAAGGA AGACACCTTC AGCACCTTCC GGAACGGATT CCTGGAGCGG
GGGTCGAAGA ACTCCGTGGA GGAGAGCAGC CTCCGGGCGT TCCGCGACGA CGACGTCGCC
TACATGCGCA ACTGGCCGTT CGCGTTCCAC CGCCTGGCCA CGGACCGTTC GATGTACGAC
GACCAGGGAC GGCTGCGCTT CGGGATGGCC GCGCTACCCG GGACAGGCAT GCTCGGCGGG
TTCAATCTGG CGATCTCCGC GCATTCCGGC AACCCGGCCA AGGCCAGGAC GCTCATCGAC
TTCCTGACCG GGCACGACGC GCAGATGAAG CTGTTCGCAT GCAGCGGATA CCCTCCCGTC
CTCGAATCCG TCTACGAGGA GTACGCCAGG AATCCGCGCA CCTGCGGCCA GTTGCTGGCC
GCCTCCGGGA CGGCCGCCGA CCCCACCGCC GGGAAGGGCA CCACGGCCCC GACGGACGGT
CCCACCCCCG TGCCCGCCGG CGAGGACACC GAGCTCACCG GCCCCATGCT CCAGGACCTG
GCGGCGAAGA TCCACCAGGC CCTGCGCACG GCCGAATCGC GTCCGCAATA TTCCTATTAC
GCCACTTTCA GTGAGGTGTT CCGTTCGTGC GCCCGCGCGG TGGTCACCGG CGACCTGCTC
GCGAAGGAGC TGGATCTCGC CCGGTTCGCC GACGCCCTGC GCGACGCGCG GCAGGGCAGA
GCGCCCGCGG AAACGGTCAG GTCGCTCGCC CACTGCGGGA AACCCGAAGG GCGGCAGGGG
CAGTGA
 
Protein sequence
MSAIRRPRDL LRRLDPAQRR WGGAGLVAGL ALASVLVSLT GKGAPPSVSS FTGCGGPNRL 
TVATGADITA GSFRRDLIEE WNRTHTTQVT LVEVADRTDE ERAEMVGRAR LGSCAYDVFS
VDVAWMAEFA RSGYIRPYPL EPGEANRYVD NVLKAGQVDG VQYAVPFVTD VPLLFHRRGV
PVPATMEQLW KYAAENGGYS VQLGDYEGGT VNLLEAIRSA GGRVTDGERI VVDEGDSAEA
VREALSRWHA LLEKGVLARG AENFSKEDTF STFRNGFLER GSKNSVEESS LRAFRDDDVA
YMRNWPFAFH RLATDRSMYD DQGRLRFGMA ALPGTGMLGG FNLAISAHSG NPAKARTLID
FLTGHDAQMK LFACSGYPPV LESVYEEYAR NPRTCGQLLA ASGTAADPTA GKGTTAPTDG
PTPVPAGEDT ELTGPMLQDL AAKIHQALRT AESRPQYSYY ATFSEVFRSC ARAVVTGDLL
AKELDLARFA DALRDARQGR APAETVRSLA HCGKPEGRQG Q