Gene Sros_5386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5386 
Symbol 
ID8668680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5903654 
End bp5905288 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content72% 
IMG OID 
ProductABC transporter ATP-binding protein 
Protein accessionYP_003340891 
Protein GI271966695 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.221909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.37846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACG CCCTGATCGT GGTTTCGAAC CTGTCATTCT CCTGGCCCGA CGACACCCCG 
GTGTTCGAGG ACCTCTCATT CACCGTCCCG TCGGGACACA CCGGCCTGGT GGCGCCCAAC
GGTGCCGGCA AGAGCACGCT GCTCAAGCTG ATCGCCGGCG CCCTTCGCCC GGCCGCCGGC
TCGGTCACGG TGGCGGGGGT GCTGGGCTAT CTGCCCCAGA CGCTGCCCCT CGTCGGCGAC
CTGACCGTGG CCGAGGTCCT GGAGATCGCT CCGGTGATCA GGGCGCTGAA CGCCATCGAG
GCCGGAGACG CGAGCGAGGA GCACTTCACG ACGATCGGCA ACGACTGGGA CATCGAGGAG
CGCACCCGTG CCCAGCTGGA CCGGCTCGGC CTCGGGGACG TCTCCCTCAC CCGGCGCCTG
CACACCCTCA GCGGCGGCCA GGTCGTCTCC CTCGGCCTGG CGGCACAGCT GCTGAAGCGG
CCCGACGTCC TGCTGCTCGA CGAACCGACC AACAACCTCG ACCTCGACGC GCGCCACAAG
CTCTACGGCG TGCTCGGGGA CTGGAACGGC TGCCTGCTGC TGGTCAGCCA CGACCGGGCA
CTGCTCGACC GCATGGACCG CATCGCCGAG CTCGACCGGG GCGAGCTCCG GTCCTACGGC
GGGAACTACA CCGAGTACGA GGAGGCCGTG CGCGCCGCGC GGGAGGTCGC CGAGAAGAAC
GTCCGCAACG CCGAGCAGGA GGTCAGGCGG GAGAAGCGGG AGATGCAGCA GGCCCGCGAG
CGGGCCGCGC GCCGGGCCGG CAACGCCGCC CGCAACCTCA AGAGCGCCGG CCTGCCGAAG
ATCTTCGCCG GGACGATGAA GCGGCGTGCC CAGGAATCGG CGGGGAGGTC GAACGAGACG
CACGCCATGC GGGTCAGCGA GGCCAGGGCC AGGCTCGACG AGGCGGGCCG GGCAGTGCGC
GACGAGCAGA AGATATCGCT GGAGCTGCCC GGGACCGGCG TCCCGGCCGG ACGCACGGTC
TTCCTCGGCG AACGGATGCA GGTCCGCTAC GGCGAGCGGG CCCTCTTCTC CGGGGAGGGG
GCGGACCTGG CGATCCGGGG GCCCGAGCGG ATCGCTCTGA CCGGCCCCAA CGGCGCCGGC
AAGTCCACCC TGCTGCGCGT GATCAACGGT GAGCTGGAGC CGGAGGGCGG CGGGACCAGG
CGGGCCGACG GCCGGGTCGC CTACCTGTCC CAACGGCTGG ACCTGCTGGA CCTCGACCGC
ACCGTGGCGG AGAACCTGGC CGCGTTCGCG CCGGGGATGC CGGAGGCGCA GCGGATGAAC
CTGCTCGCAC GCTTCCTGTT CCGGGGCTCC CGCATCCACC TCCCGGTCGG GGTGCTGTCC
GGCGGCGAGC GGCTGCGCGC CACCCTGGCC TGCGTCCTGT GCGCCGAACC GGCGCCCCAG
CTGCTGCTGC TGGACGAACC GACCAACAAC CTCGACCTGG TCAGCGTCGC CCAGCTGGAA
GGCGCCCTGC AGTCATACGA GGGCGCCTTC GTGGTGGTCA GCCACGACGA GCGGTTCCTC
GCCGAGATCG GAGTGGACCG CTGGCTGCGG CTGTCCGAGG GGCGTCTGCT CGAAACGGGC
GCCCCCGGTG CGTGA
 
Protein sequence
MSDALIVVSN LSFSWPDDTP VFEDLSFTVP SGHTGLVAPN GAGKSTLLKL IAGALRPAAG 
SVTVAGVLGY LPQTLPLVGD LTVAEVLEIA PVIRALNAIE AGDASEEHFT TIGNDWDIEE
RTRAQLDRLG LGDVSLTRRL HTLSGGQVVS LGLAAQLLKR PDVLLLDEPT NNLDLDARHK
LYGVLGDWNG CLLLVSHDRA LLDRMDRIAE LDRGELRSYG GNYTEYEEAV RAAREVAEKN
VRNAEQEVRR EKREMQQARE RAARRAGNAA RNLKSAGLPK IFAGTMKRRA QESAGRSNET
HAMRVSEARA RLDEAGRAVR DEQKISLELP GTGVPAGRTV FLGERMQVRY GERALFSGEG
ADLAIRGPER IALTGPNGAG KSTLLRVING ELEPEGGGTR RADGRVAYLS QRLDLLDLDR
TVAENLAAFA PGMPEAQRMN LLARFLFRGS RIHLPVGVLS GGERLRATLA CVLCAEPAPQ
LLLLDEPTNN LDLVSVAQLE GALQSYEGAF VVVSHDERFL AEIGVDRWLR LSEGRLLETG
APGA