Gene Sros_5019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5019 
Symbol 
ID8668313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5546043 
End bp5547410 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content70% 
IMG OID 
ProductTPR repeat-containing protein 
Protein accessionYP_003340557 
Protein GI271966361 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.4815 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.111309 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCACACC CGCACGAGGG GGTAGCGCGA GCGCTGGCCG AGGCGGAGCA GCAGGTGCGC 
CTCTTCCGGC GGCTCGTCGA GGCCGATCCC GACCGCCACT CTCCCGATCT GGCGAGCGCG
TTGGACGCCC ATGCCCTCCT CCTCGGCCTG TCGGCTCGGC GGGAGGACGC GGTGCCGGTC
GCCCAGGAGG CCGTCGACCT CTACCGTGAC CTGACCGGAA GGCAACCCGA CGAATACGCC
TCGGCCTTCG CCTCGTCGCT GATCAACCTC GGCAACCAGC TGGCCGAACT CGGCCATCAC
CACCACGCCC TCGTTCCCAT GCGAGAAGCC GTCGAGCTGT GTCGCCACCT CACCTCTGCC
GGCCCGGAAC TGGTCTCGTC ACTGGAGTTC ATCGGCATCA AGCTGGGCGA GATGGGCCGC
CATGACGAGG CGCTCGACGC CCTCAACGAA GCGATGGAGT TGCGCCGGTC GCTCATCGAC
ACGGGGAGGA CGGACCACGC CATCGCCCAC TGGACCGGGC TGGTCGCCCT CATCCACCAG
TTGCACGAAA TGGGCCGCAC CGGCGAGGCG CAGCCGTACC AGCGGATGGC CGCCCGCTAC
CACCGCCACA TAGCGGCGAC GCAGCCGGAC TTCATCGCAT TCCTCGACGA GCTGCTGAAC
GGGCACGGCT ACTCCGTGAC CGCGACCGGC TTCCGCCGCA ACCCCGGTCA CGGAACGGCG
AGCACGCCGG ACCGGGACTT GCTCGCCCGC CTGCACGAGC AGAACGAGGA GGGCATGCGG
CTGATGAGGG CGGGACGACT CGACGACGCC CTCGTTCTCT TCGAGCAAGT GATCGACGTC
CTGGACGGAG AATCGGCCGC CGCCGCGACG GCATCTCCGC TGTACAACCT CGCTCTCGTG
CTGACGAAGC TCGGCCGATA CGGTGAAGCC CTCGGCCCTC TCGACCGTGC CGTGCACATC
CACCGCGCCC TGGCGACGTC CGACGCCAGG CGTCTGCCAC AGCTCGCCTC GTCCCTGAAC
AACCTCGGCT GGCTCCTCCT GGCGCTCAGC CGGTTCGAGG ACGCCCTGGT CCCCCTGAAC
GAGGCCGTCG CTCTCTATCG CCGGCACGTC GAATCGCCCG ACGGGCACGC ACGCAGCCTC
TACAACCTCG GCGTCGGGTT GGGGCACCTC CGGCGCCACC GCGAGGCCGT GACCGCGGCC
GAGGGAGCCG TCGACCTGCT GCGTCCACTG GCCGCCTCCG ACCCCGGCGC TCATCGCGCC
AGCCTCGCGG ACGCGCTGAC CTGGCTCGGC AAGCAGCTCA GGCACCTCGG TCGCAATCGC
GAGGCCCGCG CCGCAGAGCG AGAGGCCAGG CGCCTCCAGG TGTACTGA
 
Protein sequence
MSHPHEGVAR ALAEAEQQVR LFRRLVEADP DRHSPDLASA LDAHALLLGL SARREDAVPV 
AQEAVDLYRD LTGRQPDEYA SAFASSLINL GNQLAELGHH HHALVPMREA VELCRHLTSA
GPELVSSLEF IGIKLGEMGR HDEALDALNE AMELRRSLID TGRTDHAIAH WTGLVALIHQ
LHEMGRTGEA QPYQRMAARY HRHIAATQPD FIAFLDELLN GHGYSVTATG FRRNPGHGTA
STPDRDLLAR LHEQNEEGMR LMRAGRLDDA LVLFEQVIDV LDGESAAAAT ASPLYNLALV
LTKLGRYGEA LGPLDRAVHI HRALATSDAR RLPQLASSLN NLGWLLLALS RFEDALVPLN
EAVALYRRHV ESPDGHARSL YNLGVGLGHL RRHREAVTAA EGAVDLLRPL AASDPGAHRA
SLADALTWLG KQLRHLGRNR EARAAEREAR RLQVY