Gene Sros_3978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3978 
Symbol 
ID8667272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4431758 
End bp4433017 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content68% 
IMG OID 
ProductArginine deiminase 
Protein accessionYP_003339631 
Protein GI271965435 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.103807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACTG CAGCCTTCGG CGTGCACTCC GAGGTCGGCC CCCTCCGCAA GGTCATCGTC 
CACCGGCCCG ACATGAGCCT GAAACGGCTG ACCCCCACCA ACAACGACAA GCTGCTCTTC
GACGACATCC TGTGGGTCGA GCACGCCCAG AAGGAGCACG ACAGGTTCGT CACCCTGATG
CGCGAGCGCG GCGTCGAGGT GTTCTACCAC CAGGAACTGC TGGCACAGGC GCTGGAGGCG
ACCCCCCACG CCAAGCGGAA CGCCGTCGAG CAGGCCGTCA CCCATCTGAC CGTCGGCCCC
GCCCTGGTGG ACGCCGTCCG TGAGGAGCTG TCCACCTGGA GCGGCAAGGA CCTGGCCACC
CACCTCATCG GCGGGCTGAC CAAGGAGGAG TTCGACGTCC GCGGGTTCGA CACCCGGTCC
CTGGTCGCCG CCTCGGCGGA CCCGCAGCAG TTCGTGCTCC CGCCGCTGCC CAACTCCCTC
TACCAGCGCG ACCCCGCCGC CTGGCTGTAC GGCGGCGTCT CGCTCAACCC GATGTTCTGG
CACGCGCGCC TGCTGGAGAC CATGAACCAG AGCACGATCT ACCACAACCA CCCGATGTTC
ACCGGCGAGG ACTTCTCCTA CTGGTACCCG CCGAGCGGCG ACGAGGCCGA CTTCGACGAG
GAGGACTTCG GCAAGGCCGC GCTGGAGGGC GGCGACATGA TGCCCATCGG CAACGGGACC
GTGGTCATCG GCATCAGCGA GCGGAGCACC CCGCAGATGA TCGAGCACAT CGCCCTGGCG
ACCTTCGCCG CCGGGGCGGC CGAGCGCGTC ATCGCCGTCA ACGTCCCCAA GCGCCGCTCC
TACATGCACC TGGACACCGT GTTCACCTTC CTGGACGTCG ACAAGGCCTC CGCCTACCTG
CCCTTCCTGG AGACGGCCGT CACCCACTCG CTGCGTCCCG GCGACAGGGA CCGGACTCTG
GACGTCCGCC CCGAGAGGGG CTTCGTCGAC GCCGTCGAGG ACGCGCTGGC CATCTCCCGG
CTCGACATCA TCCCCACCGG CGGGGACGAC AGCCAACAGG CCCGGGAGCA GTGGGACTCC
GGCAACAACT TCTTCGCCCT CGAACCCGGC GTCGTCGTCG GCTACCACAA GAACCAGTTC
ACCAACCGCA AGCTGCGCCA GCACGGCGTC GACGTCATCG AGATCGAGGG CTTCGAACTG
GGCAAGGGCC GGGGCGGCAC CCACTGCATG ACCTGCCCCA TCCTGCGTGA GGGCATCTGA
 
Protein sequence
MTTAAFGVHS EVGPLRKVIV HRPDMSLKRL TPTNNDKLLF DDILWVEHAQ KEHDRFVTLM 
RERGVEVFYH QELLAQALEA TPHAKRNAVE QAVTHLTVGP ALVDAVREEL STWSGKDLAT
HLIGGLTKEE FDVRGFDTRS LVAASADPQQ FVLPPLPNSL YQRDPAAWLY GGVSLNPMFW
HARLLETMNQ STIYHNHPMF TGEDFSYWYP PSGDEADFDE EDFGKAALEG GDMMPIGNGT
VVIGISERST PQMIEHIALA TFAAGAAERV IAVNVPKRRS YMHLDTVFTF LDVDKASAYL
PFLETAVTHS LRPGDRDRTL DVRPERGFVD AVEDALAISR LDIIPTGGDD SQQAREQWDS
GNNFFALEPG VVVGYHKNQF TNRKLRQHGV DVIEIEGFEL GKGRGGTHCM TCPILREGI