Gene Sros_2027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2027 
Symbol 
ID8665309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2178005 
End bp2179282 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content66% 
IMG OID 
Productmajor facilitator transporter 
Protein accessionYP_003337757 
Protein GI271963561 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.145333 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCGG ACTTGACCCT CACACCAATG TCACACATTG ATGATAAGGA TATGACACGA 
ACGGATACAT CACCATCCCC TTCCGCCGCT GACGCACGGT TCCCGCTCGC GGGTCTGCTG
GCTCTGGCTG CCACGGGGTT CATCACGCTC CTGACGGAGA CGATGCCCGC TGGGATGCTG
TCGCAGATGA GCCGCGACCT GGGAGTGAGC GAGGCGGCTG CGGGGCAGAG CGTCACGGTC
TTCGCGATCG GGGCGATTCT CGCGGCGGTC CCTCTCACGA GAGCCACGAT CGGCTGGCGA
CGCAAGCACC TGCTGCTGTT CGCGATCTCC GGGTTCGCGG TCGCCAACAC GGTCACGGCC
TTTTCCGACA GTTTCGCGCT CACCCTGGCC GCGCGGTTCC TCGGCGGGAT CGTCGGCGGC
ATGCTCTGGG CGCTCCTTGC CGGTTACGCC CGGAGGATGG TTCCCGCGCA CCAGCGCGGC
AAGGCCATGG CCATCGCGAT GGCGGGCGGG ACCGTTGCAC TGTCCGTCGG GGTCCCCGCA
GCCGCGTTCC TCGCCAAGGC CGTCGAATGG CGATTCGCGT TCGGGATCAT CACGCTGGTC
ACTCTCGCAC TGATCGTATG GGTGATCGCC TCCGTTCCGA ACTTCCCCGG CCAATCCAAG
GGCGCACGAC TGCCGCTGGC CCACACGTTC CGTCTGCCCG GAGTCGCCCC GATCCTCGTC
GTGACGTTGA CGTTCGTGTT CGCACACAAC ATCCTCTTCA CCTACATCGC CCCGTTCCTG
GCCCCGCTCG GCATGGCGGG TCAGGTCGAT TCGGTGCTGC TCACCTTCGG CCTGGTCTCG
CTCGTGAGCA TCTGGCTCAC CGGCGTACTC ATCGACCGGC ACCTGCGCAT GCTCATGATC
CTGGCCTGCG CGCTCCTGGC CACAGCCGCT CTCATCCTGA GCGTCTTCTC TGGCAACCCC
GCCCTCGTGT ACGCCAGTGC AGCCCTGTGG GGACTCGCGT TCGGCGGCGC CTCAACCCTG
CTGCAAACCG CGATCGCTGA CGCCGCAGGC GCAGCGGGCG ACGTCGCACA AGCACTCCTC
ACCACATGCT GGAACATCGC GATCGCCGCA GGCGGGATCA TCGGCGGTAT CACCCTCAAC
GTGCTGGGAC CCCCCTCCTT GAGCTGGATC ACACTCGCGC TCCTGCTCCC GGCGCTCGCT
ATCGTCATCG GCGCACGTCG ACACGGATTC ACCAGCCGCA CATTCGAACG CGAGACCGCC
TCCGAACCGG ACGCGTGA
 
Protein sequence
MQPDLTLTPM SHIDDKDMTR TDTSPSPSAA DARFPLAGLL ALAATGFITL LTETMPAGML 
SQMSRDLGVS EAAAGQSVTV FAIGAILAAV PLTRATIGWR RKHLLLFAIS GFAVANTVTA
FSDSFALTLA ARFLGGIVGG MLWALLAGYA RRMVPAHQRG KAMAIAMAGG TVALSVGVPA
AAFLAKAVEW RFAFGIITLV TLALIVWVIA SVPNFPGQSK GARLPLAHTF RLPGVAPILV
VTLTFVFAHN ILFTYIAPFL APLGMAGQVD SVLLTFGLVS LVSIWLTGVL IDRHLRMLMI
LACALLATAA LILSVFSGNP ALVYASAALW GLAFGGASTL LQTAIADAAG AAGDVAQALL
TTCWNIAIAA GGIIGGITLN VLGPPSLSWI TLALLLPALA IVIGARRHGF TSRTFERETA
SEPDA