Gene Sros_3350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3350 
Symbol 
ID8666638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3676375 
End bp3677661 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content71% 
IMG OID 
ProductABC-type sugar transport systems ATPase components-like protein 
Protein accessionYP_003339032 
Protein GI271964836 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00388587 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGTACCG TCGTCCTCGA CAATGTGAGC AAGGTCTACC CCGGCGGATA CCTGGCGGTC 
GACCGCATGA ATCTGCGCGC CGAAAACGGG GAGTTCCTCG TCCTGCTGGG GCCTTCCGGC
TGCGGCAAGT CCACACTGCT CCGGATGATC GCGGGCCTGG AGGAGGTGAC CTCCGGAGAC
CTCTGGCTGG GCGGCACCCT CGCCAACGAC CTCGCCCCGC GTGACCGGGA CGTCGCCATG
GTCTTCCAGA ACGGCGCGCT CTACCCTCAC CGCACGGTAC GCGGCAACAT GGCCTTCCCC
CTGGAGATCG CCAAGGCGGA CCCGGCGATG GTCAGGGAAC GCGTGACGGA GCTGTCCAAG
GCGCTGCACA TCGACGAGAC CCTCGAACGC CGCCCTGGAA CGCTCTCAGG CGGCCAGCGC
CAGCGCGTCG CGATGGGCAG GGCGATCGTC CGCCAGCCCT CGCTCTTCCT GATGGACGAG
CCGCTGTCCA ACCTCGACGC GGGCATGCGC ACCGAGCTCC GCATGGAGAT CTCCTCCCTG
GTCCGCTCCC TCGGCGTGAC CACGGTCTAC GTGACGCACG ACCAGGTCGA GGCCCTGACC
CTGGCCGACC GGATCGCCAT CATGAACCGC GGCGTGCTCC AGGACGTCGG CACCCCCGGT
CAGGTCTACA ACGACCCGGC CACCGCCTTC ACCGCCGCCT TCCTCAGCTC CCAGCAGCTC
AACCTGCTCG CGGCCACCGT GCGCACCCCG CAGAACCAGT TCATCCTGCT GGACTTCGGG
GTGCACCAGA TCATGATCCC TTGGACCGAT CCCAGGGCCT ACGCCATCTC CCAGCACGTC
GGCCACCAGA TCATCGTGGG CCTGCGCCCG GACTGCCTGG CGCCGGTCCC CGAGACGTTC
GAGGGCCCCG TCTTCCTCGG CCGCGTCCGA GCCCTGGAGT ACCACGGCCA CGAGTGGCTC
GCCTACGTCG AGAGCGGCAT CCCCACCGTG CCCGTCCCCG AGCCCCCGGA CCCCCGCCAC
AAGGTCCGCG ACCTGGCGGC CACCGCCCCG GGAGGCCGGG CCCGCGCGGT CCTCAGGCGC
CTGCTGCCGG GCTCCGGGCT CGACCCCGAA CCTGCGCAGG CGCAGGAGCA GGTCGGCGCC
GGCACCCACC GCCGGGCCGA CCTGATCGTC CGCGTCGGCT CCCGCCCCGT CTGGCGGGCC
GGCGAACCGG CCCGCGTGGG CGTCGACGTC ACCCGCCTGA TGCTCTTCGC CCTCGACGGC
TCCCGCATCG ACCCGCCGCA CCGCTGA
 
Protein sequence
MSTVVLDNVS KVYPGGYLAV DRMNLRAENG EFLVLLGPSG CGKSTLLRMI AGLEEVTSGD 
LWLGGTLAND LAPRDRDVAM VFQNGALYPH RTVRGNMAFP LEIAKADPAM VRERVTELSK
ALHIDETLER RPGTLSGGQR QRVAMGRAIV RQPSLFLMDE PLSNLDAGMR TELRMEISSL
VRSLGVTTVY VTHDQVEALT LADRIAIMNR GVLQDVGTPG QVYNDPATAF TAAFLSSQQL
NLLAATVRTP QNQFILLDFG VHQIMIPWTD PRAYAISQHV GHQIIVGLRP DCLAPVPETF
EGPVFLGRVR ALEYHGHEWL AYVESGIPTV PVPEPPDPRH KVRDLAATAP GGRARAVLRR
LLPGSGLDPE PAQAQEQVGA GTHRRADLIV RVGSRPVWRA GEPARVGVDV TRLMLFALDG
SRIDPPHR