Gene Sros_2349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2349 
Symbol 
ID8665631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2534648 
End bp2535880 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content73% 
IMG OID 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_003338072 
Protein GI271963876 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.773617 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGTT ATGACGAGCA CGATCAGGCC AGATGGGTGC CGGAACCCCC CGGCAACCCG 
GAGAGAGGCC CGTTCGAGCG GGACCGGGCG CGGGTGCTGC ACAGCGCGGG GCTGCGGCGG
CTCGCGGCCA AGACCCAGGT GGTCGGGCCG GGGGAGACCC TCGGCAGCGG GCAGCACATC
CCGCGCACCC GGCTGACCCA CTCGCTGGAG TGCGCGCAGG TCGGCAGGGA GATGGGACAG
TCCCTCGGCC GCGATCCCGA TCTGATGGAG ACCGCCTGCC TGGCCCACGA CCTCGGGCAC
CCGCCGTTCG GGCACAACGG GGAGACCGCG CTCAACGAGC TCGCTGCCGG GTGCGGCGGG
TTCGAGGGGA ACGCGCAGAG CCTGCGCCTG CTGACCCGGC TGGAGGCCAA GGTCCTCACC
GAGGACGGCC GCAGCGCCGG GCTCAACCTC ACCCGCGCCT CCCTGGACGC CGCGGTCAAA
TATCCCTGGA CCCGCGAGAC GAGTCCCAAA TACTGCGCCT ACGGCGACGA CATGGCCGTC
TTCGAGTGGA TCAGGGAGGG CGCGCCCGAG GGGCGGGTCA GCTTCGAGGC CCAGATCATG
GACTGGGCCG ACGACGTCGC CTACTCGGTG CACGACCTGG AGGACGCCCT CCACTCCGGC
GCCGTCGCGC CGGAGGCCCT GCGCGACCCC GCCGAGCGCC GTGAGGTCTG CGCGACCACC
CGCGCCTGGT ACGCCCCGGA GGCCGAGCCC GGCGAGCTGG AGGACCTCTT CGGCCGCCTG
GTCGCCCAGC CGCTCTGGCC CCGCCACTTC GACGGCTCGC TCGCCGCCCT CGCCGCGATC
AAGGGCCTCA CCAGCTCGCT CATCGGCCAT CTCTGCCGGT CCGCGCAGAT CGCCACCCGG
GAGTCCTACG GCCCGGCCGC GGGACGCTAC AGCGCCGACC TGATCGTGCC CAGGGCCACC
CGCCTGGAGT GCGCCCTGCT GAAGGGGCTC ACCGCCCACT ACGTGATGAC CAGGGACGCC
CACAACGCCA ACCAGGCCAG GCAGCGGGAG CTCATCCACG ACCTGGCCCA CCTGATCATG
CTGGGTGCCC CGGGCACGCT GGAGCCCGCC CTCCGGCCCT CCTTCGTCAA GGCCGGGAGC
GACGCCGGGC GGCTCCGCGT GGTCGTCGAC CAGATCGCCT CGCTCACCGA CACCTCCGCG
GTCACCTGGC AGCGGCGCCT GTCCGGACGC TGA
 
Protein sequence
MHGYDEHDQA RWVPEPPGNP ERGPFERDRA RVLHSAGLRR LAAKTQVVGP GETLGSGQHI 
PRTRLTHSLE CAQVGREMGQ SLGRDPDLME TACLAHDLGH PPFGHNGETA LNELAAGCGG
FEGNAQSLRL LTRLEAKVLT EDGRSAGLNL TRASLDAAVK YPWTRETSPK YCAYGDDMAV
FEWIREGAPE GRVSFEAQIM DWADDVAYSV HDLEDALHSG AVAPEALRDP AERREVCATT
RAWYAPEAEP GELEDLFGRL VAQPLWPRHF DGSLAALAAI KGLTSSLIGH LCRSAQIATR
ESYGPAAGRY SADLIVPRAT RLECALLKGL TAHYVMTRDA HNANQARQRE LIHDLAHLIM
LGAPGTLEPA LRPSFVKAGS DAGRLRVVVD QIASLTDTSA VTWQRRLSGR