Gene Sros_6102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6102 
Symbol 
ID8669400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6692464 
End bp6693729 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content71% 
IMG OID 
ProductHistidine--tRNA ligase 
Protein accessionYP_003341576 
Protein GI271967380 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.998614 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTGC AGGCGCCGAA GGGCACCTTT GACTGGCTCC CGCCGCGTTC GGAGCAGGCG 
CTCGCCGTGC GGGAGGCGCT GACCGCCCCG GTCCGCCGCG CGGGGTACGG CTACATCGAG
ACGCCGGTCT TCGAGGACAC CGCGCTGTTC GTCCGCGGTG TCGGCGAGTC GACCGACATC
GTCTCCAAGG AGATGTACAC CTTCGAGGAC AAGGGCGGGC GCTCGCTCAC GCTGCGCCCC
GAGGGCACCG CGTCGGTCGT GCGCTCGGTC CTCCAGCACG GCCTGCACAA CGGCCAGCTC
CCGGTGAAGC TCTGGTACTC CGGCAGCCAG TTCCGCTACG AGCGCGCGCA GAAGGGCCGC
TACCGCCACT TCTGGCAGAT CGGCGCCGAG GCCCTGGGAG CCGAGGACCC CGCGCTGGAC
GCCGAGCTGA TCGTGCTGGC CGCCGACGGC TACGCCGGGC TGGGCCTCAC CGGCGTGCGG
CTGCTGCTCA ACACGCTGGG CGACAAGGAG TGCCGTCCCG GCTACCGGAC GGCGCTGCAG
GACTTCCTGC GCGCCCTCGA CCTCGACGAG CCCACCCGGC AGCGGATCGA GATCAATCCG
TTGCGCGTCC TCGACGACAA GCGCCCCGAG GTGCAGGCCC AGCTCGCCGG CGCCCCGCTG
GTCGTCGACC ACCTGTGCGA GGCCTGCAAG GCCTACCACG AGGAGGTCCG CTCGCTGCTG
ACCGCCGCCG GCGTGGCCTA CACCGACGAC CCCCGGCTGG TCCGCGGTCT CGACTACTAC
ACGCGCACCA CCTTCGAGTT CGTCCACGAC GGGCTGGGCT CGCAGTCGGC GGTCGGCGGC
GGCGGCCGCT ACGACGGGCT GAGCGAGATG CTCGGCGGCC CCGCCCTGCC CAGCGTCGGC
TGGGCGCTCG GCGTCGACCG GACGCTCCTG GCAATGGAGG CCGAGGGGCT GGCCGGTGCC
GAGACCGCCG AGTCGCGTGT CCAGGTGTAC GGTGTGCCGC TGGGTGAGGA GGCGCGCCGC
CGGATGTTCC TGCTCATGAC CGAGCTGCGC CGGGCCGGTC TCGACGCCGA CATGTCGTTC
GGCGGCAAGG GCGTCAAGGG TGCCATGAAG GGCGCCGACC GGTCGGGTGC GAGCTATGCC
GTGATCCTCG GCGAGCGAGA TATCGCCGCC GGGTCCGCGC AGGTCAAGGA CCTGGCCAGC
GGTGACCAGA CCGCCGTACC GCTCGCTGAG ATCGTCACGA CCTTGAAGGA GAGACTGAAG
AAATGA
 
Protein sequence
MTLQAPKGTF DWLPPRSEQA LAVREALTAP VRRAGYGYIE TPVFEDTALF VRGVGESTDI 
VSKEMYTFED KGGRSLTLRP EGTASVVRSV LQHGLHNGQL PVKLWYSGSQ FRYERAQKGR
YRHFWQIGAE ALGAEDPALD AELIVLAADG YAGLGLTGVR LLLNTLGDKE CRPGYRTALQ
DFLRALDLDE PTRQRIEINP LRVLDDKRPE VQAQLAGAPL VVDHLCEACK AYHEEVRSLL
TAAGVAYTDD PRLVRGLDYY TRTTFEFVHD GLGSQSAVGG GGRYDGLSEM LGGPALPSVG
WALGVDRTLL AMEAEGLAGA ETAESRVQVY GVPLGEEARR RMFLLMTELR RAGLDADMSF
GGKGVKGAMK GADRSGASYA VILGERDIAA GSAQVKDLAS GDQTAVPLAE IVTTLKERLK
K