Gene Sros_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1971 
Symbol 
ID8665253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2115486 
End bp2116736 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content67% 
IMG OID 
ProductCoenzyme F420-dependent N5 N10-methylene tetrahydromethanopterin reductase-like protein 
Protein accessionYP_003337702 
Protein GI271963506 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCT CGATATTCCT CAACCCGCAG ATCCCCGGCT CCGGATACTC CAACGAGGAG 
AACGCGGTCG CCAAGCGGCC GATCGGGCGG GACGTCGAGT CCTATCAGGC GCTGCTCCAC
GAGGTCCGCG AGATCGCGAT CCACGCCGAC CAGATCGGCT TCGACGCGCT GATGATGACC
GAGCACCACT TCCACTCCGA GGGCTTCGAG TTCTCGGTGA ACCCGCTGAT GTTCCTCACC
GACCTGGCGG CCAGGACCGA GCGCATCCTG CTGGCCCCGC TGGGCATCGT GCTGCCCGCC
TGGGACCCGA TCAGGGCGGC CGAGGACGTG GCCCTTCTGG ACCAGTTCAG CAAGGGACGG
CTCCGCCTGG GCGTGGCCCG CGGCTACCAG AACCGGTGGA TGAACGTGCT GGGCCAGCGC
TGGCAGGCCT CCGCCGCCCG CTCCGACGGC TCCAAGAGCG ACACCCGCAA CTTCGACGTC
TTCGGCGAGG TCCTGAAGAT CATGAAGATG GCGTGGACCC AGGACACGCT GCGCTACAAG
TCCGACGTGC TGGACTACGA GGTGCCCGCG CCGTTCGACG GCATCGAGGG CTGGCCCGCG
CTGGAGTGGA CCCAGAAGTT CGGCGCGCCG GGCGAGGTGG ACGACCAGGG CAGGATCCAT
GCCGTCTCGG TCGGGCCCAA GCCGTACCAG TATCCGTATC CGGAGCTGTG GCAGCCGTTC
ACCATCTCCG ACCGCTCGGT GATCCGGGCC GCGCAGGAGG ACATCCTGCC GTGGATGTTC
ACCCCGAACC CGGACGAGCA CGCCGCCAAG GCCAAGCTCT ACCAGGAGGA GTCGGCCAAG
TGCGGCCGCG ACTACAAGCT GGGCGAGCAC ACCGGCATCC TCAAGATCGT CGGCATGGCC
GACACCCGCG AGGAGGCCAT CGCCACCTAC GGCAAGAGCA TGCAGAAGGA CTTCGCCGCC
TTCTTCGGCC CGTTCGGCTA CCTGGAGGTC CTCCGCAAGA AGGAGGACGA CAGGCACCAG
CCGATCAGCC CGGAGAAGGG CGACTACAAG CGCATGAACG AGGTCGAGAT GGCCCTGCTG
GGCGGTCCCG ACGACGTCAA GCGCGGCATC CAGCGCATGC TCGACCGGAT GCCCGACCTG
GAGTGGTTCG GCCTGTTCAT GCAGGGACAG CAGGGGGTCC TCCCCCTCGA CACCGTCAAG
CGCAACCTCG AACTCTTCGC CACCAAGGTC ATCCCCGAGT TCTCCGACTG A
 
Protein sequence
MKFSIFLNPQ IPGSGYSNEE NAVAKRPIGR DVESYQALLH EVREIAIHAD QIGFDALMMT 
EHHFHSEGFE FSVNPLMFLT DLAARTERIL LAPLGIVLPA WDPIRAAEDV ALLDQFSKGR
LRLGVARGYQ NRWMNVLGQR WQASAARSDG SKSDTRNFDV FGEVLKIMKM AWTQDTLRYK
SDVLDYEVPA PFDGIEGWPA LEWTQKFGAP GEVDDQGRIH AVSVGPKPYQ YPYPELWQPF
TISDRSVIRA AQEDILPWMF TPNPDEHAAK AKLYQEESAK CGRDYKLGEH TGILKIVGMA
DTREEAIATY GKSMQKDFAA FFGPFGYLEV LRKKEDDRHQ PISPEKGDYK RMNEVEMALL
GGPDDVKRGI QRMLDRMPDL EWFGLFMQGQ QGVLPLDTVK RNLELFATKV IPEFSD