Gene Sros_7895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_7895 
Symbol 
ID8671219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8705973 
End bp8708279 
Gene Length2307 bp 
Protein Length768 aa 
Translation table11 
GC content70% 
IMG OID 
Productexcinuclease ABC subunit A 
Protein accessionYP_003343295 
Protein GI271969099 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCGA GGACGGACAC ACAGTCGCCT GCACCACACG TTGCCGACAG CCACGATCTG 
ATCCGTGTGC AGGGCGCGCG CGAGAACAAC CTCAAGGACG TCAGCATCGA GATCCCGAAG
CGCCGGCTGA CGGTGTTCAC CGGCGTCTCC GGCTCGGGCA AGAGCTCGCT GGTGTTCGGC
ACGATCGCCG CGGAGTCGCA GCGGATGATC AACGAGACCT ACAGCACCTT CGTGCAGGGC
TTCATGCCGA CGCTGGCACG GCCGGAGGTC GACCTGCTGG AGGGGCTGAC AACGGCGATC
ATCGTCGACC AGGAGCGGAT GGGCGCCAAC TCCCGCTCCA CCGTCGGCAC CGCCACCGAC
GCCAACGCGA TGCTGCGGAT CGTCTTCAGC CGGCTCGGCG AGCCGAGTGC CGGGCCGTCG
TTCCACTACA GCTTCAACAC GCCCGCCGGC GCCTGCCCGC GCTGCGAGGG CATGGGCAGC
GTGACCGACG TCGACCTGAC CCAGCTCTAC GACGACAGCA AGTCGCTCGC CGAGGGCGCG
CTCACGATTC CCGGCTACAG CATGGACGGC TGGTACGGCC GGATCTTCAG CGGCTGCGGC
TTCTTCGACC CCGACAAGCC GATCCGCAAG TTCACCAAGG CCGAGCTCCA CGACCTGCTC
CACAAGGAGC CGACCAAGAT CAAGGTCGAC GGCATCAACC TGACGTACGA GGGCCTGATC
CCGAAGATCC AGAAGTCGAT GCTGTCCAAG GACCGGGATG CGCTGCAGCC GCACATCCGG
GCGTTCGTGG ACCGGGCCGT CACCTTCACC GCCTGTCCCG AGTGCGACGG CAGCCGGCTC
AACGAGGCCG CCCGCTCCTC GCGGATCAAG GGCGTCAACA TCGCCGACGC CTGCGCGATG
CAGATCAGCG ACCTGGCCGA ATGGGTCCAC GGCCTCGACG AGGTGTCGGT GGCGCCGCTG
CTCGCCACGC TGCGGCAGAC CCTCGACTCG TTCGTGGAGA TCGGGCTGGG CTACCTCTCG
CTCGACCGGC CGTCGGGCAC GCTGTCGGGC GGCGAGGCAC AGCGCACCAA GATGATCCGC
CACCTCGGCT CCTCGCTCAC CGACGTCACC TACGTCTTCG ACGAGCCCAC CACGGGCCTG
CACCCCCATG ACATCCAGCG GATGAACGAC CTGCTGCTGC GGCTGCGGGA CAAGGGCAAC
ACGGTGCTCG TCGTGGAGCA CAAGCCGGAG GCGATCGCGA TCGCCGACCA CGTCGTCGAC
CTCGGCCCCG GCGCCGGTAC GGCGGGCGGC ACCGTCTGCT ACGAGGGCAG CCTCGAAGGA
CTGCGGGGCA GCGGCACCCT CACCGGCCGC CACCTCGACG ACCGGGCCGC CATCAAGGAG
ACGGTGCGCA CCCCCAAGGG CACGCTGGAG GTCCGCGGCG CGACGGCGCA CAACCTGCGC
GACGTCGACG TCGACATCCC GCTCGGGGTG CTCTGCGTCG TCACCGGCGT CGCCGGCTCC
GGCAAGAGCT CGCTCGTGCA CGGGTCGATC CCCGCCGGCG CGGGTGTGGT GTCGATCGGC
CAGGGCGCGA TCCGCGGCTC GCGACGGAGC AACCCGGCGA CGTACACCGG CCTGCTCGAC
CCGATCCGCA AGGCGTTCGC GAAGGCCAAC GGCGTGAAGC CGGCGCTGTT CAGCGCCAAC
TCCGAGGGCG CCTGCCCCAA CTGCAACGGT GCCGGCGTCA TCTACACCGA CCTGGCGATG
ATGGCCGGCG TCGCCACCGT CTGCGAGGAG TGCGACGGGA AGCGGTTCCA GGCATCGGTG
CTGGACCACC ACCTCGGCGG CCGCGACATC AGCGAGGTGC TCGCGATGTC GGTGACCGAG
GCCGAGGAGT TCTTCGGCGC CGGCGAGGCG CGCACGCCGG CAGCGCACGC CATCCTCAAC
CGGCTCGCCG ACGTCGGGCT CGGATACCTC AGCCTCGGCC AGCCGCTCAC CACGCTGTCC
GGCGGCGAGC GGCAGCGGCT CAAGCTGGCC ACCCACATGG CCGAGAAGGG CGGCGTCTAC
GTCCTCGACG AGCCGACCAC CGGCCTCCAC CTCGCCGACG TCGAGCAGCT GCTCGGCCTG
CTCGACCGGC TCGTCGACTC CGGCAAGTCG GTCATCGTCA TCGAGCACCA CCAGGCGGTC
ATGGCGCACG CCGACTGGAT CATCGACCTC GGTCCCGGTG CCGGCCACGA CGGCGGCCGG
ATCGTCTTCG AGGGCACACC CGCCGACCTC GTCGCCGCCC GCTCCACCCT CACCGGCGAG
CACCTCGCGG CCTACGTCGG CACCTGA
 
Protein sequence
MTPRTDTQSP APHVADSHDL IRVQGARENN LKDVSIEIPK RRLTVFTGVS GSGKSSLVFG 
TIAAESQRMI NETYSTFVQG FMPTLARPEV DLLEGLTTAI IVDQERMGAN SRSTVGTATD
ANAMLRIVFS RLGEPSAGPS FHYSFNTPAG ACPRCEGMGS VTDVDLTQLY DDSKSLAEGA
LTIPGYSMDG WYGRIFSGCG FFDPDKPIRK FTKAELHDLL HKEPTKIKVD GINLTYEGLI
PKIQKSMLSK DRDALQPHIR AFVDRAVTFT ACPECDGSRL NEAARSSRIK GVNIADACAM
QISDLAEWVH GLDEVSVAPL LATLRQTLDS FVEIGLGYLS LDRPSGTLSG GEAQRTKMIR
HLGSSLTDVT YVFDEPTTGL HPHDIQRMND LLLRLRDKGN TVLVVEHKPE AIAIADHVVD
LGPGAGTAGG TVCYEGSLEG LRGSGTLTGR HLDDRAAIKE TVRTPKGTLE VRGATAHNLR
DVDVDIPLGV LCVVTGVAGS GKSSLVHGSI PAGAGVVSIG QGAIRGSRRS NPATYTGLLD
PIRKAFAKAN GVKPALFSAN SEGACPNCNG AGVIYTDLAM MAGVATVCEE CDGKRFQASV
LDHHLGGRDI SEVLAMSVTE AEEFFGAGEA RTPAAHAILN RLADVGLGYL SLGQPLTTLS
GGERQRLKLA THMAEKGGVY VLDEPTTGLH LADVEQLLGL LDRLVDSGKS VIVIEHHQAV
MAHADWIIDL GPGAGHDGGR IVFEGTPADL VAARSTLTGE HLAAYVGT