Gene Sros_8009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8009 
Symbol 
ID8671334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8821710 
End bp8822930 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content75% 
IMG OID 
ProductCystathionine gamma-lyase 
Protein accessionYP_003343407 
Protein GI271969211 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCGCC GCGCCCGCAG ACGGCAGGGC GAGAACACCC GATCGGTGCA CCTGCCCGCC 
CCGCAGGCGC CGCAGCAGGC CCCCCTCGGC CTCCCGGTGT GGCGCAGCTC CGCGTGGAGC
TCCGCCGACT CCGCCCGGCA CGCCGAGGCC TTCGACGACA AGAGCTCCGC CCACGGCCGG
ATCGAGAACC CCACCACCGC CGCCTTCGCC GCGGCGGTGG CCTCGCTGGA GGGCGCCGCG
GCGCCCGAGG ACGTCGCCGG GCAGGCGTTC GCGTCGGGGA TGGCCGCGGT CAGCGCGGTC
CTGATGACGT TCCTGCGCGC GGGCGCGCAC GTCGTGGCCC CGATCCCGCT GTACGGCGCG
ACCCGCTCGC TGATCACCAA CGTGCTGTCC CGGTTCGGGG TCGAGGCCGG GTTCGTGAAC
TACGGCGACC TCGCCCGGGT GCAGGCGGCG ATCCGGCCCG CCACGAAGAT CATCTATGCC
GAGACGCTGT CCAATCCCAC GACGGCCGTC TCCGACATCC GCGGCCTCTA CCGGATCGCC
CGCCGGGCCG GGGCGCTGCT GGTGGTCGAC TCCACCTTCG CCACCCCGGT CGTGTGCCGT
CCCCTCGAAC ACGGCGCCGA CATGGTGATC CACTCCGCGG CCCGCTACCT CGGCGGCCAC
GACGACTGCG CGGGCGGTGT CGTGGTCGGC CGGCCCGACC TGATCGCCCG GCTCCGGGAG
GTCAGCGCCG ACATCGGGGG CACGCTCTCC CCGGACGACG CGTTCCTGCT CCGCCGGGGG
CTGGAGACCC TGCCGCTGCG GGTGCGCCGG ATGTGCGCCA CCGCGATGGT GTTCGCCGCG
GCCGTGGCCA AGCATCCGGC CGTCCGCCGG GTCGACTACC CCGGCCTGCC CGGTCACCCC
GGCCACCAGC TCGCCCGGCA CCTGTTCGAC TCCGGCCCCG AGGGCACCAG GTACGGCGCC
TGCGTGACGA TCACCCCGCG CGGCGGCTAC GGGGCGGGCA TGGCGCTGAC CGACGCCCTC
AAGCTCGCCT CGATCGCCAC CTCCGCCGGC GGCACCCGCA CCAAGGCCAC GCACCTGGCC
TCGGCCGGTC ACCGCCGGTC CGGTGACGCC TCCGGGATCG ACGCGGCGGC CGTCCGCTTC
TCCATCGGCC TGGAGGACGC CGAGGATCTC ATCGTGGACG TGACCCAGGC GCTCGACTCT
CTCCCCGAAT CGGCCGGATA G
 
Protein sequence
MNRRARRRQG ENTRSVHLPA PQAPQQAPLG LPVWRSSAWS SADSARHAEA FDDKSSAHGR 
IENPTTAAFA AAVASLEGAA APEDVAGQAF ASGMAAVSAV LMTFLRAGAH VVAPIPLYGA
TRSLITNVLS RFGVEAGFVN YGDLARVQAA IRPATKIIYA ETLSNPTTAV SDIRGLYRIA
RRAGALLVVD STFATPVVCR PLEHGADMVI HSAARYLGGH DDCAGGVVVG RPDLIARLRE
VSADIGGTLS PDDAFLLRRG LETLPLRVRR MCATAMVFAA AVAKHPAVRR VDYPGLPGHP
GHQLARHLFD SGPEGTRYGA CVTITPRGGY GAGMALTDAL KLASIATSAG GTRTKATHLA
SAGHRRSGDA SGIDAAAVRF SIGLEDAEDL IVDVTQALDS LPESAG