Gene Sros_1729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1729 
Symbol 
ID8665006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1844544 
End bp1846400 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content70% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003337463 
Protein GI271963267 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.50707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCCA ACAATCCCCC CGACCAGTCG CCTACGCCAG GCCAGCAGCC TGACAGGACG 
ATCGCCTACC GTTGGAACGA GGGTGCGCAG CAGAACAGTC AACCGCACGC CCAGGGCCAC
CCCCAGCAGG GTGGTTACCC GCAGCAGGGA CAGCCCGGCT ACCCGCAGCA GCCCGGATAC
CCACAGCAGA ACTACGGCCA GCAGGGCCAG CCCGGCTACC AGCAGCAGCC GCCGAACTAC
GGCCAGCAGC AGGGCCAGCA GGGCTACCCC GGCTACCAGC AGCAGGGCCA GCAGGGTTAC
CAGCAGGCCC AGCCCGGCTA CCAGCAGCAG CAGGGCTACC AGCAGCAGAA CTACGGCCAG
CAGCCCGGCT GGCAGCAGCA GGGCCCCGAT TTCCTCGGCA CGGGACAGCC GACCCCGCCC
GCCCGCAAGG GCGGCAAGGG CTGGCTGATC GCGGTGATCG CCGCCCTGGT CGTCGTCCTC
GTGGGCGGCG GCGGCGCCTT CGCGGTCAAC CTGCTCAGCG GCGGTGGCAC CCAGCCGCAC
GACGTGCTGC CCGGCAACGC CATCGGCTAC GCGCGCCTCG ACTTCGACCC GGCGGCCAAC
CAGAAGCTGG CGCTGTTCAG CATCGCCCGG AAGTTCACCG TCACCAAGGA CTCCTTCACC
GGCGACGACC CGCGCAAGGC CTTCTTCGAC CAGGCCAAGA AGAGCGGCTT CGACAAGGTG
GACTACGCCG CCGATGTCCA GCCGTGGCTC GGCGACCGCA TCGGCATGGC CGCGCTCACC
CCGGCCAAGC GCGGTGCCGA GCCCGGCTTC GTGGTCGCCG TCCAGGTGAC CGACGAGGCC
AAGGCGAAGG CGGGAATCGC CAAGCTGATG GACGGGGAGA AGTACGGCAT CGCGTTCCGC
GAGGACTACG CGCTGCTCAC CGCCACCCAG GCGGAGGCCG ACCAGGCCGC CAAGGCGGCG
CCCCTGTCCG ACAACGCCAA CTTCTCCGAC GACCTGAGCG CCCTGGGTGA GACCGGCGTG
CTCTCCTTCT GGATGGACGC GGGCAAGCTC GCGGACCTCG CCTCCGAGAT CGCCCCCCAG
GACCCCGCCA CCCTCGCGCA GATCAAGAAC GTCCGCGTGG CCGGCGCGCT CCGCTTCGAC
GGCCAGTACG TCGAACTGGC CGGCATCAGC CGCGGGGCGA AGGCCCTGGA GGGCATGGGC
GAGCCCGAGC CCTCCAGGAT CGGCCAGCTC CCGGTCTCCA CCGCCGGCGC GATCTCGATC
TCCGGTCTCG GCGACGTGAT CGGCAAGCAG TGGGCCCAGA TCATGAAGTC GGCCGACCAG
GCCGGCGGCG GCGGGAGCTT CCAGCAGTTC GCCGACCAGG CCCAGCAGAA GTACGGGCTG
GCGCTCCCCG CCGACCTGGC GACGATGCTC GGCAAGAACC TCACCCTGGC GGTGGACGCC
AACGGCCTCG ACGGCGACCA GCCCAAGTTC GGGGCCCGGA TCACCACCGA CCCGGCCAAG
GCGCAGGAGG TCGTCGGCAA GATCGAGAAG TTCCTCGCCG ACTCGGGCAC CGCGGTCCCG
CAGCTCGCCA AGGTCCCCGG TGACGGCACC TTCGTCCTGG CCAGCTCGCA GGAGTACGCC
GCCGAACTCG CCAAGGACGG CAGCCTGGCC GACGACGAGA CGTTCAACCT CGCGATCCCC
GACGCCGGCG CGGCGACCTT CGCCGCCTAC GTCGACCTCA ACAAGGTCGA GAAGTTCTAC
CTGGAGAGCC TGCAGGGTGA CGACAAGGCC AACCTCCAGC AGCTGCGCGC CGTAGGGATC
AGCGGAACGC AGTCCGGTAC GGACGCCTCC TTCTCCCTGC GAGTGCTGTT CGACTGA
 
Protein sequence
MPANNPPDQS PTPGQQPDRT IAYRWNEGAQ QNSQPHAQGH PQQGGYPQQG QPGYPQQPGY 
PQQNYGQQGQ PGYQQQPPNY GQQQGQQGYP GYQQQGQQGY QQAQPGYQQQ QGYQQQNYGQ
QPGWQQQGPD FLGTGQPTPP ARKGGKGWLI AVIAALVVVL VGGGGAFAVN LLSGGGTQPH
DVLPGNAIGY ARLDFDPAAN QKLALFSIAR KFTVTKDSFT GDDPRKAFFD QAKKSGFDKV
DYAADVQPWL GDRIGMAALT PAKRGAEPGF VVAVQVTDEA KAKAGIAKLM DGEKYGIAFR
EDYALLTATQ AEADQAAKAA PLSDNANFSD DLSALGETGV LSFWMDAGKL ADLASEIAPQ
DPATLAQIKN VRVAGALRFD GQYVELAGIS RGAKALEGMG EPEPSRIGQL PVSTAGAISI
SGLGDVIGKQ WAQIMKSADQ AGGGGSFQQF ADQAQQKYGL ALPADLATML GKNLTLAVDA
NGLDGDQPKF GARITTDPAK AQEVVGKIEK FLADSGTAVP QLAKVPGDGT FVLASSQEYA
AELAKDGSLA DDETFNLAIP DAGAATFAAY VDLNKVEKFY LESLQGDDKA NLQQLRAVGI
SGTQSGTDAS FSLRVLFD