Gene Sros_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3801 
Symbol 
ID8667091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4239794 
End bp4241641 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content70% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003339464 
Protein GI271965268 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA CCACGACTCA GGGATGGCGC GGCGTCGCCG CCGAGAACAA GGACGACCTG 
TCGGAGCAGG TGTCGTTCCT GCTCCGCGCC CGGTCACGAC GGCTGCTCGG CGAGCTGCTG
CGGCCGTACC GCAGGGAGAT CGCGCTGCTC GTCGCGGTCA TCGTGATCTG CAACGCCGCC
GCTCTCGCCA TCCCCTACCT GATCAAGGTG GGCATCGACG CCGGCATCCC GCCGATGGTC
GCGGGCGAGG GGCCGGCCAC GCTGGTGACG GTCGTCGTGG CGGTCCTGGC GGCGGCGATC
ACCCAGGCGG CCACCCGGCA GGTGTTCCTC AGGATGGCGG GCCGCATCGG CCAGAGCATC
CTGCTGGAGC TACGGCGGCG GGTCTTCGCC CACTTCCAGA GGCTGTCGCT GTCGTTCCAC
GACGACTACA CCTCGGGCCG GGTCGTCGCC CGGCTCACCT CCGACATCGA GGCCATCTCC
GAGATGCTGC AGTCGGGCTT CGACGGCCTG GTCACCGCCG TGCTGACGCT GACCGGCACC
GCGGTCCTGC TGCTCGTCCT CGACGTGCCG CTCGCCGTGG TCGCGCTGCT GCCGCTCCCG
GTCCTGCTGC TGTTCACCCG ATGGTTCCGG CGGCAGTCGA GCATCACCTA CCGCAGGACC
CGGGAGACCG TGGCGCTGGT GATCGTCCAC TTCGTGGAGT CGATGACCGG CATCCGGGCG
GTCCAGGCGT TCCGCAGGGA GCCGCGCAAC CAGGAGATCT TCGCCCAGCT CAACGCCGAC
TACGGGCACG CCAACGTGCA GAGCATGCGC CTCATCGCGC TCTTCATGCC GGGCGTCAAG
CTCATCGGGA ACGTCACGAT CGCCGCCGTC CTGTTCTACG GCGGCCTGCT GGCCATCGAC
GGCGACGTCA CGGTGGGCGT GCTCGCCGCG TTCCTGCTCT ACCTGCGCCA GTTCTACGAG
CCGATGCAGG AGATCAGCCA GTTCTACAAC ACCTTCCAGT CGGCGGGGGC GGCCCTGGAG
AAACTCTCCG GCGTGCTGGA GGAGAGGCCC GCGGTGGCCG AGCCCCGCAC TCCCGTGGCG
CTGGAACGGC CACGCGGGGA GATCCGGTTC GAGGAGGTGG AGTTCTCCTA CCTGGACGGC
ACCCCGGTAC TGTCCCGGAT GGATCTGGCG ATCCCGGCGG GGCAGACCGT GGCGCTGGTC
GGCACCACCG GGGCGGGGAA GACCACGCTG GCGAAACTGG TCTCCCGGTT CTACGACCCC
GTCGCCGGCC GTGTGCTGCT CGACGGGGTC GACCTGCGTG ATCTCGGCGA GGACTCGCTG
CGCGGCGCGG TGGTCATGGT GACCCAGGAG AACTTCCTGT TCACCGGATC GGTCGCCGAC
AACATCAGGT TCGGCCGGCC CGGCTCGACC ATGGCCGAGG TCGTCGAGGC CGCCCGGTCC
ATCGGCGCCC ACGAGTTCAT CTCGGCGCTT CCGGAGGGCT ACGACACCCA GGTCGCCAAA
CACGGCGGCA GGCTGTCGGC CGGGCAGCGG CAGCTCGTGG CGTTCGCCCG GGCCTTCCTC
GCCGACCCCG CGGTGCTGAT CCTCGACGAG GCGACCTCCA GCCTGGACGT CCCCGGCGAA
CGGCTGGTGC AGCGGGCGAT GCGGACGATC CTGGCGGAAC GGACCGCTCT GATCATCGCC
CACCGGCTGT CGACCGTCGA GATCGCCGAC CGGGTGCTCG TGATGGACGG CGGCGGCATC
GTCGAGGACG GTCCTCCCGA CCAGCTCATC GCACGGGCGG GCCGCTTCGC CGGCCTGCAC
CAGGCGTGGC TGGACAGCAT CTCGGATCTG CCGGCCCCAC CCGGTTAG
 
Protein sequence
MSQTTTQGWR GVAAENKDDL SEQVSFLLRA RSRRLLGELL RPYRREIALL VAVIVICNAA 
ALAIPYLIKV GIDAGIPPMV AGEGPATLVT VVVAVLAAAI TQAATRQVFL RMAGRIGQSI
LLELRRRVFA HFQRLSLSFH DDYTSGRVVA RLTSDIEAIS EMLQSGFDGL VTAVLTLTGT
AVLLLVLDVP LAVVALLPLP VLLLFTRWFR RQSSITYRRT RETVALVIVH FVESMTGIRA
VQAFRREPRN QEIFAQLNAD YGHANVQSMR LIALFMPGVK LIGNVTIAAV LFYGGLLAID
GDVTVGVLAA FLLYLRQFYE PMQEISQFYN TFQSAGAALE KLSGVLEERP AVAEPRTPVA
LERPRGEIRF EEVEFSYLDG TPVLSRMDLA IPAGQTVALV GTTGAGKTTL AKLVSRFYDP
VAGRVLLDGV DLRDLGEDSL RGAVVMVTQE NFLFTGSVAD NIRFGRPGST MAEVVEAARS
IGAHEFISAL PEGYDTQVAK HGGRLSAGQR QLVAFARAFL ADPAVLILDE ATSSLDVPGE
RLVQRAMRTI LAERTALIIA HRLSTVEIAD RVLVMDGGGI VEDGPPDQLI ARAGRFAGLH
QAWLDSISDL PAPPG