Gene Sros_1230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1230 
Symbol 
ID8664505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1260500 
End bp1261969 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content70% 
IMG OID 
Productglutamate--cysteine ligase, GCS2 
Protein accessionYP_003336971 
Protein GI271962775 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00550271 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGTCGCG ATGTGCCTGC GATGGTGTTC AGCCGGGAGG ACCGACGGCG ATACCGGGAC 
AAGGTCCGCC GATGTCTTGA CGTCTTCGCG CAGATGTTGC GCGAGGCGAG GTTCGAGTGC
GAACGGCCGA AGGCCGGGCT GGAGATCGAG CTCAACCTCG TGGACGACCG CGGCGACCCC
GCGATGAAGA ACGCCGAGGT GCTGGCGGCG ATCGCCGAGC CGGACTGGGC CACCGAGCTC
GGCCAGTTCA ACGTGGAGAT CAACGTCCTG CCGGAGTCCC TTGAGGGGGA CGGCCCCGTA
CGGCTGGAGA AGGTCGTGCG GGACCGGCTC AACCACGCGG AGAACCGGGC CCACACGGTC
GGCGGGCACC TGGTGATGGT GGGCATCCTG CCCACCCTGC GGGAGAGCGA CGTGCACGAG
GGCACGCTGT CGGCCAACCC GCGCTACAAG CTGCTCAACG AGCAGATCTT CGAGGCCAGG
GGCGAGGACC TGCACCTGGC GATCGACGGC GAGGAGACCC TCGACACCTA CGCCGACAGC
ATCACCCCCG AGGCGGCCTG CACGAGCCTC CAGCTCCACC TCCAGGTCAG CCCCGCGGCT
TTCGCCGCCC ACTGGAACGC GGCCCAGGCC ATCGCGGGCG CCCAGGTGGC GGTGGCGGCC
AACTCCCCGT TCCTGTTCGG CCGCCAGCTC TGGCAGGAGA CCAGGATCCC GCTGTTCGAG
CAGGCCACCG ACACCCGGCC GGTGGAGCTG AAGACCCAGG GCGTGCGGCC CAGGGTGTGG
TTCGGCGAGC GGTGGATCAC CTCGGTCTTC GACCTGTTCG AGGAGAACGC GCGCTACTTC
CCCGCGCTCC TGCCCATATG CGAGGACGCC GACCCGCGTG AGGAGCTGAC CCGCGGCGTC
ACCCCCGCGC TGGACGAGCT GACCCTGCAC AACGGCACCG TCTACCGGTG GAACCGGCCG
GTCTACGCCG TCGTCGACGA CATCCCGCAC CTGCGGGTGG AGAACCGGGT GCTGCCCGCC
GGGCCGTCGG TCGCCGACGT CGCCGCCAAC GCCGCGTTCT ACTACGGCCT CATGCGCGTG
CTTCCCCACG CCGAACGGCC GGTGTGGACC CGCATGTCCT TCGCCGCGGC CGGGGACAAC
CTGCACTCCG CCGCCCGGCA CGGGCTGGAT GCGCGCCTCT ACTGGCCGGG ACTCGGCGAG
GTGGCCGCCG CCGAACTGAT CCTGCGACGG CTGCTGCCGC TCGCCTACGA GGGCCTCGAC
CTGTGGGGGG TGAACCCCGA GCCCAGGGAC CGGCTGCTGG GGATCATCGA GCAGCGGTGC
GTGACAGGCA GGACCGGGGC GACCTGGCAG ATCGACACCG TGAAGGAGCT GGGGAACCTC
GACCGGCGCG AGGCGCTGCG CCGGATGACC CTGCGCTACA TCGAGCACAT GCACACCAAC
GAGCCCGTGC ACACCTGGCC GTCACCTTGA
 
Protein sequence
MGRDVPAMVF SREDRRRYRD KVRRCLDVFA QMLREARFEC ERPKAGLEIE LNLVDDRGDP 
AMKNAEVLAA IAEPDWATEL GQFNVEINVL PESLEGDGPV RLEKVVRDRL NHAENRAHTV
GGHLVMVGIL PTLRESDVHE GTLSANPRYK LLNEQIFEAR GEDLHLAIDG EETLDTYADS
ITPEAACTSL QLHLQVSPAA FAAHWNAAQA IAGAQVAVAA NSPFLFGRQL WQETRIPLFE
QATDTRPVEL KTQGVRPRVW FGERWITSVF DLFEENARYF PALLPICEDA DPREELTRGV
TPALDELTLH NGTVYRWNRP VYAVVDDIPH LRVENRVLPA GPSVADVAAN AAFYYGLMRV
LPHAERPVWT RMSFAAAGDN LHSAARHGLD ARLYWPGLGE VAAAELILRR LLPLAYEGLD
LWGVNPEPRD RLLGIIEQRC VTGRTGATWQ IDTVKELGNL DRREALRRMT LRYIEHMHTN
EPVHTWPSP