Gene Sros_3802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3802 
Symbol 
ID8667092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4241638 
End bp4243485 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content71% 
IMG OID 
Productlipid A export ATP-binding protein 
Protein accessionYP_003339465 
Protein GI271965269 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.360172 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.799293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCTG ATACCGCCCC CGAGCGAGAT ACCGAGACCG GTCAGTCCCC CACCCCCCGC 
CCGCGCAGCC TCAGGACGAG CTCGCTGTGG CGGATGAAGT CCTACCTCCG TCCCTACACC
ACCCGCCTGC TGCTCATCTG GCTACCGGCC TTCGGCGGGA TCGCCATCGG CATCGTCATC
CCGCTGATCG GCAAGGAGAT CATCGACGGC CCCGTCGCCC GCGGCGACGC CGGAGCCCTC
CTCCCCCTCG CACTGCTCGC CCTGGCCCTG GGCGTGATCG AGGCGCTGCT CATCTTCCTG
CGAAGGTGGT TCCTGGCCGA TGCCGTCATC GGCCTGGAGA CGACGATCCG CGACGACCTC
TACCGGCACC TGCAGCGGCT GCCGATGAGC TTCCACGGCG CCTGGCAGTC CGGCCAGCTG
CTCTCCCGCG CGACGACCGA CCTGTCGGTC ATCCGGCGCT TCCTCGGCTT CGGCATGCTC
TTCCTCGTCC TGATCATCTT CCAGATCGTG ACGGTGACGG GGCTGCTGCT GCAGATGTAC
TGGCCGCTCG GCCTGCTGGT CGCCGCGGCG GCGATCCCGG TCGTGGTCAC CTCGCTGCGC
TTCGAGCGGG GCTACATCAC CGTCTCCCGC CAGGTCCAGG ACGAGCAGGG CGACCTCGCC
ACCGTGGTCG AGGAGTCCGC GGTCGGCATC CGCACGATCA AGGCCTTCGG CCGCGGCCGC
CACGTCTACG ACACCTTCGA CGACGGGGCC CGCAAGGTCT ACCGGACCTC GATGGAGAAG
GTACGGCTGT CGGCCAGGTT CTTCACCTTC CTGGAGGTGA TCCCGAACGT CACCCTCGCC
GTCGTGCTGC TGCTGGGCGC CCTCGCGGTC GGCTCGGGCT CGCTGACGCT GGGCACCCTG
GTGGCCTTCA CCACGCTGAT GCTGCAGCTC GTGTGGCCCG TCTCGGCTCT CGGCTTCATC
CTGGTCATGG CCCAGGAGGC GATGACCTCG GCCGACCGGG TGATGGAGGT GCTCGACACC
GACCCGGAGA TCGCCGGCGG CCTGGACGTG GTCGAGCGCC CCCGCGGCCA CCTGCGCTAC
GAGGGCGTGG AGTTCCGCTT CCCCGGGGCC GCCGAGCCGG TGCTCCGCGA CGTCTGGCTG
GACGTGCGGC CGGGAGAGAC GGTCGCGGTC GTGGGCGCGA CGGGTTCGGG AAAGACCACC
CTGACCTCCC TCGTCCCCCG GCTGTACGAC GTCAGCGCGG GCCGGGTCAC GATCGACGGG
CACGACGTGC GGGACCTGTC GCTGCCCGTC CTGCGCTCGA TGGTCGCCAC GGCCTTCGAG
GAGCCGACCC TGTTCTCCAT GAGCGTCAGG GAGAACCTCA CGCTGGGGCG GCACGACGCC
ACCGACGAGG AGATCGAGGA GGCGCTGCGG GTCGCCCAGG CCGGGTTCGT CCACCAGCTG
CCGTGGGGGC TGGAGACCAG GATCGGCGAG CAGGGCATGT CCCTGTCCGG CGGCCAGCGC
CAGCGCCTCG CCCTGGCCCG TGCGGTCCTC AGCCGGCCGA GGGTCCTCGT GCTGGACGAC
ACCCTGTCGG CCCTGGACGT CGAGACCGAG GCCCTGGTGG AGGAGGCCCT GCGGCATGTC
CTGCGGGACG CCACCGGGAT CGTGGTGGCG CACCGCGCCT CCACCGTGCT GCTCGCCGAC
AAGGTGGCCC TGCTGCTGAA CGGCACGATC GCGCACGTCG GCCGCCACCA GGAGCTGATG
GCGGGCGTGC CGGAGTACCG GGCGCTGCTG TCGGCGGAAC TGGACGGCGA CCCCGACGGG
CTGGACGGGC TGGACGGCGA CCTGGACAGC GAGGGGGCCC TCCGATGA
 
Protein sequence
MPPDTAPERD TETGQSPTPR PRSLRTSSLW RMKSYLRPYT TRLLLIWLPA FGGIAIGIVI 
PLIGKEIIDG PVARGDAGAL LPLALLALAL GVIEALLIFL RRWFLADAVI GLETTIRDDL
YRHLQRLPMS FHGAWQSGQL LSRATTDLSV IRRFLGFGML FLVLIIFQIV TVTGLLLQMY
WPLGLLVAAA AIPVVVTSLR FERGYITVSR QVQDEQGDLA TVVEESAVGI RTIKAFGRGR
HVYDTFDDGA RKVYRTSMEK VRLSARFFTF LEVIPNVTLA VVLLLGALAV GSGSLTLGTL
VAFTTLMLQL VWPVSALGFI LVMAQEAMTS ADRVMEVLDT DPEIAGGLDV VERPRGHLRY
EGVEFRFPGA AEPVLRDVWL DVRPGETVAV VGATGSGKTT LTSLVPRLYD VSAGRVTIDG
HDVRDLSLPV LRSMVATAFE EPTLFSMSVR ENLTLGRHDA TDEEIEEALR VAQAGFVHQL
PWGLETRIGE QGMSLSGGQR QRLALARAVL SRPRVLVLDD TLSALDVETE ALVEEALRHV
LRDATGIVVA HRASTVLLAD KVALLLNGTI AHVGRHQELM AGVPEYRALL SAELDGDPDG
LDGLDGDLDS EGALR